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SEQUENCE LISTING 

<110> Yocum, R. et al, 

<120> MICROORGANISMS AND ASSAYS FOR THE IDENTIFICATION OF 
ANTIBIOTICS 

<130> OGZ-001 

<140> 
<141> 

<150> US 60/227, 860 
<151> 2000-08-24 



<160> 76 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 777 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (774) 

<4 00> 1 

ttg tta ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 48 

Leu Leu Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 

15 10 15 

tat cat gat gga aaa tta gaa tat cac tgg cgt ata gaa aca age agg 96 
Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg lie Glu %hr Ser Arg 

20 25 30 

cat aaa aca gaa gat gag ttt ggg atg att ttg cgc tec tta ttt gat 144 
His Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu Phe Asp 
35 40 45 

cac tec ggg ctt atg ttt gaa cag ata gat ggc att att att teg tea 192 
His Ser Gly Leu Met Phe Glu Gin lie Asp Gly lie lie lie Ser Ser 
50 55 60 

gta gtg ccg cca ate atg ttt gcg tta gaa aga atg tgc aca aaa tac 240 
Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

ttt cat ate gag cct caa att gtt ggt cca ggt atg aaa acc ggt tta 288 
Phe His lie Glu Pro Gin lie Val Gly Pro Gly Met Lys Thr Gly Leu 

85 90 95 

aat ata aaa tat gac aat ccg aaa gaa gta ggg gca gac aga ate gta 336 
Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie Val 

100 105 110 



aat get gtc get gcg ata cac ttg tac ggc aat cca tta att gtt gtc 



384 



f 



OGZ-001 



-2- 



Asn Ala Val Ala Ala lie His Leu Tyr Gly Asn Pro Leu lie Val Val 
115 120 125 



gat ttc gga acc gcc aca acg tac tgc tat att gat gaa aac aaa caa 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asp Glu Asn Lys Gin 
130 135 140 



432 



tac atg ggc ggg gcg att gcc cct ggg att aca att teg aca 
Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He' Ser Thr 
145 150 155 



gag gcg 
Glu Ala 
160 



480 



ctt tac teg cgt gca gca aag ctt cct cgt ate gaa ate acc 
Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr 

165 170 



egg ccc 
Arg Pro 
175 



528 



gac aat att ate gga aaa aac act gtt age gcg atg caa tct 
Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser 

180 185 190 



gga 
Gly 



att 
He 



576 



tta ttt ggc tat gtc ggc caa gtg gaa gga ate gtt aag cga atg aaa 
Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 205 



624 



tgg cag gca aaa cag gac etc aag gtc att gcg aca gga ggc ctg gcg 
Trp Gin Ala Lys Gin Asp Leu Lys Val He Ala Thr Gly Gly Leu Ala 
210 215 - 220 



672 



ccg etc att gcg aac gaa tea gat tgt ata gac ate gtt gat cca ttc 
Pro Leu He Ala Asn Glu Ser Asp Cys He Asp lie Val Asp Pro Phe 
225 230 235 240 



720 



tta acc eta aaa ggg ctg gaa ttg att tat gaa aga aac cgc gta gga 
Leu Thr Leu Lys Gly Leu Glu Leu He Tyr Glu Arg Asn Arg Val Gly 

245 250 255 



768 



agt gta tag 
Ser Val 



777 



<210> 2 
<211> 258 
<212> PRT 

<213> Bacillus subtilis 
<400> 2 

Leu Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 

20 25 30 

His Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu Phe Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin lie Asp Gly He lie lie Ser Ser 
50 55 60 

Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 



Phe His lie Glu Pro Gin lie Val Gly Pro Gly Met Lys Thr Gly Leu 
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Asn lie Lys Tyr 

100 

Asn Ala Val Ala 
115 

Asp Phe Gly Thr 
130 

Tyr Met Gly Gly 
145 

Leu Tyr Ser Arg 



Asp Asn lie lie 

180 

Leu Phe Gly Tyr 
195 

Trp Gin Ala Lys 
210 

Pro Leu lie Ala 
225 

Leu Thr Leu Lys 



Ser Val 



Asp Asn Pro Lys 



Ala lie His Leu 

120 

Ala Thr Thr Tyr 
135 

Ala lie Ala Pro 
150 

Ala Ala Lys Leu 
165 

Gly Lys Asn Thr 



Val Gly Gin Val 

200 

Gin Asp Leu Lys 
215 

Asn Glu Ser Asp 
230 • 

Gly Leu Glu Leu 



Glu Val Gly Ala 
105 

Tyr Gly Asn Pro 



Cys Tyr lie Asp 

140 

Gly lie Thr lie 
155 

Pro Arg lie Glu 
170 

Val Ser Ala Met 
185 

Glu Gly lie Val 



Val lie Ala Thr 

220 

Cys lie Asp lie 
235 

lie Tyr Glu Arg 
250 



Asp Arg lie Val 
110 

Leu lie Val Val 
125 

Glu Asn Lys Gin 



Ser Thr Glu Ala 

160 

lie Thr Arg Pro 
175 

Gin Ser Gly lie 
190 

Lys Arg Met Lys 
205 

Gly Gly Leu Ala 



Val Asp Pro Phe 

240 

Asn Arg Val Gly 
255 



<210> 3 
<211> 250 
<212> PRT 

<213> Clostridium acetobutylicum 
<400> 3 

Asn Lys Arg Ala Ala Phe Met Leu Leu Leu Phe Leu Arg Ser Val Leu 
15 10 15 

Lys Val lie Leu Val Leu Asp Val Gly Asn Thr Asn lie Val Leu Gly 

20 25 30 

lie Tyr Asn Asp Thr Lys Leu Thr Ala Glu Trp Arg Leu Ser Thr Asp 
35 40 45 

Val Leu Arg Ser Ala Asp Glu Tyr Gly lie Gin Val Met Asn Leu Phe 
50 55 60 

Gin Gin Asp Lys Leu Asp Pro Thr Leu Val Glu Gly Val lie lie Ser 
65 70 75 80 

Ser Val Val Pro Asn lie Met Tyr Ser Leu Glu His Met lie Arg Lys 

85 90 95 



Tyr Phe Lys lie Asn Pro Leu Val Val Gly Pro Gly- lie Lys Thr Gly 
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100 

lie Asn lie Lys 
115 

Val Asn Ala Val 
130 

lie Asp Phe Gly 
145 

Asp Tyr Leu Gly 



Ala Leu Phe Glu 

180 

Pro Ala Tyr Ala 
195 

lie Val Tyr Arg 
210 

Lys Glu Asn Leu 
225 

Ala Thr Gly Gly 



Tyr Asp Asn Pro 

120 

Ala Ala His Glu 
135 

Thr Ala Thr Thr 
150 

Gly Ala lie Cys 
165 

Lys Ala Ala Lys 



lie Cys Lys Asn 

200 

Tyr Leu Arg Gin 
215 

Pro Asp Gly Arg 
230 

Leu Ala Lys Leu 
245 



105 

Lys Glu Val Gly 



lie Tyr Lys Arg 

140 

Phe Cys Ala Val 
155 

Pro Gly lie Lys 
170 

Leu Pro Arg Val 
185 

Thr lie Ser Ser 



Val Lys Tyr Leu 

220 

Arg Thr Arg Thr 
235 

lie Asn 
250 



110 

Ala Asp Arg lie 
125 

Ser Leu lie lie 



Arg Glu Asn Gly 

160 

Val Ser Ser Glu 
175 

Glu Leu lie Lys 
190 

lie Gin Ser Gly 
205 

Phe Glu Lys Leu 



Ser Leu Val Leu 

240 



<210> 4 
<211> 2 65 
<212> PRT 

<213> Streptomyces coelicolor 
<400> 4 

Met Leu Leu Thr lie Asp Val Gly Asn Thr His Thr Val Leu Gly Leu 
15 10 15 

Phe Asp Gly Glu Asp lie Val Glu His Trp Arg He Ser Thr Asp Ser 

20 25 30 

Arg Arg Thr Ala Asp Glu Leu Ala Val Leu Leu Gin Gly Leu Met Gly 
35 40 45 

Met His Pro Leu Leu Gly Asp Glu Leu Gly Asp Gly He Asp Gly He 
50 55 60 

Ala He Cys Ala Thr Val Pro Ser Val Leu His Glu Leu Arg Glu Val 
65 70 75 80 

Thr Arg Arg Tyr Tyr Gly Asp Val Pro Ala Val Leu Val Glu Pro Gly 

85 90 95 

Val Lys Thr Gly Val Pro He Leu Thr Asp His Pro Lys Glu Val Gly 

100 105 110 

Ala Asp Arg He He Asn Ala Val Ala Ala Val Glu Leu Tyr Gly Gly 
115 120 125 

Pro Ala He Val Val Asp Phe Gly Thr Ala Thr Thr Phe Asp Ala Val 
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130 135 140 

Ser Ala Arg Gly Glu Tyr lie Gly Gly Val He Ala Pro Gly He Glu 
145 150 155 160 

He Ser Val Glu Ala Leu Gly Val Lys Gly Ala Gin Leu Arg Lys He 

165 170 175 

Glu Val Ala Arg Pro Arg Ser Val He Gly Lys Asn Thr Val Glu Ala 

180 185 190 

Met Gin Ser Gly He Val Tyr Gly Phe Ala Gly Gin Val Asp Gly Val 
195 200 205 

Val Asn Arg Met Ala Arg Glu Leu Ala Asp Asp Pro Asp Asp Val Thr 
210 215 220 

Val lie Ala Thr Gly Gly Leu Ala Pro Met Val Leu Gly Glu Ser Ser 
225 230 235 240 

Val He Asp Glu His Glu Pro Trp Leu Thr Leu Met Gly Leu Arg Leu 

245 250 255 

Val Tyr Glu Arg Asn Val Ser Arg Met 

260 265 



<210> 5 
<211> 272 
<212> PRT 

<213> Mycobacterium tuberculosis 
<400> 5 

Met Leu Leu Ala He Asp Val Arg Asn Thr His Thr Val Val Gly Leu 
15 10 15 

Leu Ser Gly Met Lys Glu His Ala Lys Val Val Gin Gin Trp Arg lie 

20 25 30 

Arg Thr Glu Ser Glu Val Thr Ala Asp Glu Leu Ala Leu Thr He Asp 
35 40 45 

Gly Leu He Gly Glu Asp Ser Glu Arg Leu Thr Gly Thr Ala Ala Leu 
50 55 60 

Ser Thr Val Pro Ser Val Leu His Glu Val Arg lie Met Leu Asp Gin 
65 70 75 80 

Tyr Trp Pro Ser Val Pro His Val Leu lie Glu Pro Gly Val Arg Thr 

85 90 95 

Gly He Pro Leu Leu Val Asp Asn Pro Lys Glu Val Gly Ala Asp Arg 

100 105 110 

He Val Asn Cys Leu Ala Ala Tyr Asp Arg Phe Arg Lys Ala Ala lie 
115 120 125 

Val Val Asp Phe Gly Ser Ser lie Cys Val Asp Val Val Ser Ala Lys 
130 135 140 

Gly Glu Phe Leu Gly Gly Ala lie Ala Pro Gly Val Gin Val Ser Ser 
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145 150 155 160 

Asp Ala Ala Ala Ala Arg Ser Ala Ala Leu Arg Arg Val Glu Leu Ala 

165 170 175 

Arg Pro Arg Ser Val Val Gly Lys Asn Thr Val Glu Cys Met Gin Ala 

180 185 190 

Gly Ala Val Phe Gly Phe Ala Gly Leu Val Asp Gly Leu Val Gly Arg 
195 200 205 

lie Arg Glu Asp Val Ser Gly Phe Ser Val Asp His Asp Val Ala lie 
210 215 220 

Val Ala Thr Gly His Thr Ala Pro Leu Leu Leu Pro Glu Leu His Thr 
225 230 235 240 

Val Asp His Tyr Asp Gin His Leu Thr Leu Gin Gly Leu Arg Leu Val 

245 250 255 

Phe Glu Arg Asn Leu Glu Val Gin Arg Gly Arg Leu Lys Thr Ala Arg 

260 265 270 



<210> 6 
<211> 258 
<212> PRT 

<213> Rhodobacter capsulatus 
<400> 6 

Met Leu Leu Cys lie Asp Cys Gly Asn Thr Asn Thr Val Phe Ser Val 
1 -5 10 15 

Trp Asp Gly Thr Asp Phe Ala Ala Thr Trp Arg lie Ala Thr Asp His 

20 25 30 

Arg Arg Thr Ala Asp Glu Tyr Phe Val Trp Leu Asn Thr Leu Met Gin 
35 ' 40 45 

Leu Lys Gly Leu Gin Gly Arg lie Ser Glu Ala lie lie Ser Ser Thr 
50 55 60 

Ala Pro Arg Val Val Phe Asn Leu Arg Val Leu Cys Asn Arg Tyr Phe 
65 70 75 80 

Asp Cys Arg Pro Tyr Val Val Gly* Lys Pro Gly Cys Glu Leu Pro Val 

85 90 95 

Ala Pro Arg Val Asp Pro Gly Thr Thr Val Gly Pro Asp Arg Leu Val 

100 105 110 

Asn Thr Val Ala Gly Tyr Asp Arg His Gly Gly Asp Leu lie Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Phe Asp Val Val Ala Pro Asp Gly Ala 
130 135 140 

Tyr lie Gly Gly Val lie Ala Pro Gly Val Asn Leu Ser Leu Glu Ala 
145 150 155 160 



Leu His Met Ala Ala Ala Ala Leu Pro His Val Asp Val Thr Lys Pro 
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165 170 175 

Gin Gly Val lie Gly Thr Asn Thr Val Ala Cys lie Gin Ser Gly Val 

180 185 190 

Tyr Trp Gly Tyr lie Gly Leu Val Glu Gly lie Val Arg Gin lie Arg 
195 200 205 

Met Glu Arg Asp Arg Pro Met Lys Val lie Ala Thr Gly Gly Leu Ala 
210 215 220 

Ser Leu Phe Asp Leu Gly Phe Asp Leu Phe Asp Lys Val Glu Asp Asp 
225 230 235 240 

Leu Thr Met His Gly Leu Arg. Leu lie Phe Asp Tyr Asn Lys Gly Leu 

245 250 255 

Gly Ala 



<210> 7 
<211> 255 
<212> PRT 

<213> Geobacter sulf urreducens 
<400> 7 

Met Leu Leu Val lie Asp Val Gly Asn Thr Asn lie Val Leu Gly lie 
15 10 15 

Tyr Asp Gly Glu Arg Leu Val Arg Asp Trp Arg Val Ser Thr Asp Lys 

20 25 30 

Ala Arg Thr Thr Asp Glu Tyr Gly lie Leu lie Asn Glu Leu Phe Arg 
35 40 45 

Leu Ala Gly Leu Gly Leu Asp Gin lie Arg Ala Val lie lie Ser Ser 
50 55 60 

Val Val Pro Pro Leu Thr Gly Val Leu Glu Arg Leu Ser Leu Gly Tyr 
65 70 75 80 

Phe Gly Met Arg Pro Leu Val Val Gly Pro Gly lie Lys Thr Gly Met 

85 90 95 

Pro lie Gin Tyr Asp Asn Pro Arg Glu Val Gly Ala Asp Arg lie Val 

100 105 110 

Asn Ala Val Ala Gly Tyr Glu Lys Tyr Arg Thr Ser Leu lie lie Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Phe Asp Tyr Val Asn Arg Lys Gly Glu 
130 135 140 

Tyr Cys Gly Gly Ala lie Ala Pro Gly Leu Val lie Ser Thr Glu Ala 
145 150 155 160 

Leu Phe Gin Arg Ala Ser Lys Leu Pro Arg Val Asp lie lie Arg Pro 

165 170 175 

Ser Ala lie lie Ala Arg Asn Thr Val Asn Ser Met Gin Ala Gly lie 
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180 

Tyr Tyr Gly Tyr 
195 

Ala Glu Ser Lys 
210 

Ser Leu lie Ala 
225 

Leu Thr Leu Glu 



Val Gly Leu Val 

200 

Asp Ala Pro Arg 
215 

Pro Glu Ser Lys 
230 

Gly Leu Arg lie 



185 

Asp Glu lie Val 



Val lie Ala Thr 

220 

Thr lie Glu Ala 
235 

Leu Tyr Glu Arg 
250 



190 

Thr Arg Met Lys 
205 

Gly Gly Leu Ala 



Val Glu Glu Tyr 

240 

Asn Arg Glu 
255 



<210> 8 
<211> 262 
<212> PRT 

<213> Deinococcus radiopugnans 
<400> 8 

Met Pro Ala Phe Pro Leu Leu Ala Val Asp lie Gly Asn Thr Thr Thr 
15 10 15 

Val Leu Gly Leu Ala Asp Ala Ser Gly Ala Leu Thr His Thr Trp Arg 

20 25 30 

lie Arg Thr Asn Arg Glu Met Leu Pro Asp Asp Leu Ala Leu Gin Leu 
35 40 45 

His Gly Leu Phe Thr Leu Ala Gly Ala Pro lie Pro Arg Ala Ala Val 
50 55 60 

Leu Ser Ser Val Ala Pro Pro Val Gly Glu Asn Tyr Ala Leu Ala Leu 
65 70 75 80 

Lys Arg His Phe Met lie Asp Ala Phe Ala Val Ser Ala Glu Asn Leu 

85 90 95 

Pro Asp Val Thr Val Glu Leu Asp Thr Pro Gly Ser Val Gly Ala Asp 

100 105 110 

Arg Leu Cys Asn Leu Phe Gly Ala Glu Lys Tyr Leu Gly Gly Leu Asp 
115 120 125 

Tyr Ala Val Val Val Asp Phe Gly Thr Ser Thr Asn Phe Asp Val Val 
130 135 140 

Gly Arg Gly Arg Arg Phe Leu Gly Gly lie Leu Ala Thr Gly Ala Gin 
145 150 155 160 

Val Ser Ala Asp Ala Leu Phe Ala Arg Ala Ala Lys Leu Pro Arg lie 

165 170 175 

Thr Leu Gin Ala Pro Glu Thr Ala lie Gly Lys Asn Thr Val His Ala 

180 185 190 

Leu Gin Ser Gly Leu Val Phe Gly Tyr Ala Glu Met Val Asp Gly Leu 
195 200 205 



Leu Arg Arg lie Arg Ala Glu Leu Pro Gly Glu Ala Val Ala Val Ala 
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210 215 220 

Thr Gly Gly Phe Ser Arg Thr Val Gin Gly lie Cys Gin Glu lie Asp 
225 230 235 240 

Tyr Tyr Asp Glu Thr Leu Thr Leu Arg Gly Leu Val Glu Leu Trp Ala 

245 250 255 

Ser Arg Ser Glu Val Arg 

260 



<210> 9 
<211> 246 
<212> PRT 

<213> Thermotoga maritima 
<400> 9 

Met Tyr Leu Leu Val Asp Val Gly Asn Thr His Ser Val Phe Ser lie 
15 10 15 

Thr Glu Asp Gly Lys Thr Phe Arg Arg Trp Arg Leu Ser Thr Gly Val 

20 25 30 

Phe Gin Thr Glu Asp Glu Leu Phe Ser His Leu His Pro Leu Leu Gly 
35 40 45 

Asp Ala Met Arg Glu lie Lys Gly lie Gly Val Ala Ser Val Val Pro 
50 55 60 

Thr Gin Asn Thr Val lie Glu Arg Phe Ser Gin Lys Tyr Phe His lie 
65 70 75 80 

Ser Pro lie Trp Val Lys Ala Lys Asn Gly Cys Val Lys Trp Asn Val 

85 90 95 

Lys Asn Pro Ser Glu Val Gly Ala Asp Arg Val Ala Asn Val Val Ala 

100 105 110 

Phe Val Lys Glu Tyr Gly Lys Asn Gly lie lie lie Asp Met Gly Thr 
115 120 125 

Ala Thr Thr Val Asp Leu Val Val Asn Gly Ser Tyr Glu Gly Gly Ala 
130 135 140 

lie Leu Pro Gly Phe Phe Met Met Val His Ser Leu Phe Arg Gly Thr 
145 150 155 160 

Ala Lys Leu Pro Leu Val Glu Val Lys Pro Ala Asp Phe Val Val Gly 

165 170 175 

Lys Asp Thr Glu' Glu Asn lie Arg Leu Gly Val Val Asn Gly Ser Val 

180 185* 190 

Tyr Ala Leu Glu Gly. lie lie Gly Arg He Lys Glu Val Tyr Gly Asp 
195 200 205 

Leu Pro Val Val Leu Thr Gly Gly Gin Ser Lys He Val Lys Asp Met 
210 215 220 



He Lys His Glu He Phe Asp Glu Asp Leu Thr He Lys Gly Val Tyr 
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225 230 235 240 

His Phe Cys Phe Gly Asp 

245 



<210> 10 
<211> 273 
<212> PRT 
<213> Treponema 

<400> 10 
Met Leu Leu lie 
1 

Gly Glu Asn Gly 

20 

Pro Asp Ala Arg 
35 

Leu Cys Glu Arg 
50 

lie Ser Ser Val 
65 

Ala Gin lie Ser 



Glu His Leu Pro 

100 

Thr Asp Leu Val 
115 

Ala Cys Val Val 
130 

Asp Gly Thr Gly 
145 

Thr Ala Val Gin 



Pro Leu Ala Leu 

180 

Val Gin Ala Gly 
195 

lie Ala Gin Cys 
210 

Thr Gly Gly Leu 
225 

lie Asp Ala Gin 



Val Pro Thr Ser 



pallidum 



Asp Val Gly Asn 
5 

Gly Arg Val Cys 



Lys Thr Gin Asp 

40 

Ala Gly Val Gly 
55 

Val Pro Val Leu 
70 

Gly Val Gin Pro 
85 

Val Arg lie Pro 



Ala Asn Ala Val 

120 

Val Asp Cys Gly 
135 

Leu lie Gin Gly 
150 

Ser Leu His Thr 
165 

Pro Asp Ser Val 



Val Val Arg Gly 

200 

Gin Lys Glu Leu 
215 

Ser Arg Leu Phe 
230 

Leu Thr Leu Ser 
245 

Leu Leu Pro Pro 



Ser His Val Val 
10 

Val Arg Glu Leu 
25 

Glu Tyr Ser Leu 



Arg Ala Ser Leu 

60 

Thr Lys Thr lie 
75 

Val Val Phe Gly 
90 

Glu Pro Val Arg 
105 

Ala Ala Tyr Val 



Thr Ala Leu Thr 

140 

Val Ala lie Ala 
155 

Gly Thr Ala Gin 
170 

Leu Gly Lys Asp 
185 

Thr Leu Phe Val 



Gly Cys Arg Cys 

220 

Ser Ser Glu Val 
235 

Gly Leu Ala His 
250 

Ala Thr Val Ser 



Phe Gly lie Gin 
15 

Phe Arg Leu Ala 
30 

Leu lie His Ala 
45 

Arg Asp Ala Phe 



Ala Asp Ala Val 

80 

Pro Trp Ala Tyr 
95 

Ala Glu lie Gly 
110 

His Phe Arg Ser 
125 

Phe Thr Ala Val 



Pro Gly Leu Arg 

160 

Leu Pro Leu Val 
175 

Thr Thr His Ala 
190 

lie Arg Ala Met 
205 

Ala Ala Val lie 



Asp Phe Pro Pro 

240 

lie Ala Arg Leu 
255 

Gly Ser Ser Gly 
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260 265 270 

Asn 



<210> 11 
<211> 262 
<212> PRT 

<213> Borrelia burgdorferi 
<400> 11 

Met Asn Lys Pro Leu Leu Ser Glu Leu lie lie Asp lie Gly Asn Thr 
1 5 10 15 

Ser lie Ala Phe Ala Leu Phe Lys Asp Asn Gin Val Asn Leu Phe lie 

20 25 30 

Lys Met Lys Thr Asn Leu Met Leu Arg Tyr Asp Glu Val Tyr Ser Phe 
35 40 45 

Phe Glu Glu Asn Phe Asp Phe Asn Val Asn Lys Val Phe lie Ser Ser 
50 55 60 

Val Val Pro lie Leu Asn Glu Thr Phe Lys Asn Val lie Phe Ser Phe 
65 70 75 80 

Phe Lys lie Lys Pro Leu Phe lie Gly Phe Asp Leu Asn Tyr Asp Leu 

85 90 95 

Thr Phe Asn Pro Tyr Lys Ser Asp Lys Phe Leu Leu Gly Ser Asp Val 

100 105 110 

Phe Ala Asn Leu Val Ala Ala lie Glu Asn Tyr Ser Phe Glu Asn Val 
115 120 125 

Leu Val Val Asp Leu Gly Thr Ala Cys Thr lie Phe Ala Val Ser Arg 
130 135 140 

Gin Asp Gly lie Leu Gly Gly lie lie Asn Ser Gly Pro Leu lie Asn 
145 150 155 160 

Phe Asn Ser Leu Leu Asp Asn Ala Tyr Leu lie Lys Lys Phe Pro lie 

165 170 175 

Ser Thr Pro Asn Asn Leu Leu Glu Arg Thr Thr Ser Gly Ser Val Asn 

180 185 190 

Ser Gly Leu Phe Tyr Gin Tyr Lys Tyr Leu He Glu Gly Val Tyr Arg 
195 200 205 

Asp He Lys Gin Met Tyr Lys Lys Lys Phe Asn Leu He He Thr Gly 
210 215 220 

Gly Asn Ala Asp Leu He Leu Ser Leu He Glu He Glu Phe lie Phe 
225 230 235 240 

Asn lie His Leu Thr Val Glu Gly Val Arg lie Leu Gly Asn Ser lie 

245 250 255 



Asp Phe Lys Phe Val Asn 
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260 



<210> 12 
<211> 229 
<212> PRT 

<213> Aquifex aeolicus 
<400> 12 

Met Arg Phe Leu Thr Val Asp Val Gly Asn Ser Ser Val Asp lie Ala 
15 10 15 

Leu Trp Glu Gly Lys Lys Val Lys Asp Phe Leu Lys Leu Ser His Glu 

20 25 30 

Glu Phe Leu Lys Glu Glu Phe Pro Lys Leu Lys Ala Leu Gly lie Ser 
35 40 45 

Val Lys Gin Ser Phe Ser Glu Lys Val Arg Gly Lys lie Pro Lys lie 
50 55 60 

Lys Phe Leu Lys Lys Glu Asn Phe Pro lie Gin Val Asp Tyr Lys Thr 
65 70 75 80 

Pro Glu Thr Leu Gly Thr Asp Arg Val Ala Leu Ala Tyr Ser Ala Lys 

85 90 95 

Lys Phe Tyr Gly Lys Asn Val Val Val lie Ser Ala Gly Thr Ala Leu 

100 105 110 

Val lie Asp Leu Val Leu Glu Gly Lys Phe Lys Gly Gly Phe lie Thr 
115 120 125 

Leu Gly Leu Gly Lys Lys Leu Lys lie Leu Ser Asp Leu Ala Glu Gly 
130 135 140 

lie Pro Glu Phe Phe Pro Glu Glu Val Glu lie Phe Leu Gly Arg Ser 
145 150 155 160 

Thr Arg Glu Cys Val Leu Gly Gly Ala Tyr Arg Glu Ser Thr Glu Phe 

165 170 175 

lie Lys Ser Thr Leu Lys Leu Trp Arg Lys Val Phe Lys Arg Lys Phe 

180 185 190 

Lys Val Val lie Thr Gly Gly Glu Gly Lys Tyr Phe Ser Lys Phe Gly 
195 200 205 

lie Tyr Asp Pro Leu Leu Val His Arg Gly Met Arg Asn Leu Leu Tyr 
210 215 220 

Leu Tyr His Arg lie 
225 



<210> 13 
<211> 257 
<212> PRT 

<213> Synechocystis sp. 



<400> 13 
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Met Glu Thr Ser Lys Pro Gly Cys Gly Leu Ala Leu Asp Asn Asp Lys 
15 10 15 

Gin Lys Pro Trp Leu Gly Leu Met lie Gly Asn Ser Arg Leu His Trp 

20 25 30 

Ala Tyr Cys Ser Gly Asn Ala Pro Leu Gin Thr Trp Val Thr Asp Tyr 
35 40 45 

Asn Pro Lys Ser Ala Gin Leu Pro Val Leu Leu Gly Lys Val Pro Leu 
50 55 60 

Met Leu Ala Ser Val Val Pro Glu Gin Thr Glu Val Trp Arg Val Tyr 
65 70 75 80 

Gin Pro Lys lie Leu Thr Leu Lys Asn Leu Pro Leu Val Asn Leu Tyr 

85 90 95 

Pro Ser Phe Gly lie Asp Arg Ala Leu Ala Gly Leu Gly Thr Gly Leu 

100 105 110 

Thr Tyr Gly Phe Pro Cys Leu Val Val Asp Gly Gly Thr Ala Leu Thr 
115 120" 125 

lie Thr Gly Phe Asp Gin Asp Lys Lys Leu Val Gly Gly Ala lie Leu 
130 135 140 

Pro Gly Leu Gly Leu Gin Leu Ala Thr Leu Gly Asp Arg Leu Ala Ala 
145 150 155 160 

Leu Pro Lys Leu Glu Met Asp Gin Leu Thr Glu Leu Pro Asp Arg Trp 

165 170 175 

Ala Leu Asp Thr Pro Ser Ala lie Phe Ser Gly Val Val Tyr Gly Val 

180 185 190 

Leu Gly Ala Leu Gin Ser Tyr Leu Gin Asp Trp Gin Lys Leu Phe Pro 
195 200 205 

Gly Ala Ala Met Val lie Thr Gly Gly Asp Gly Lys lie Leu His Gly 
210 215 220 

Phe Leu Lys Glu His Ser Pro Asn Leu Ser Val Ala Trp Asp Asp Asn 
225 230 235 240 

Leu lie Phe Leu Gly Met Ala Ala lie His His Gly Asp Arg Pro lie 

245 250 255 

Cys 



<210> 14 
<211> 223 
<212> PRT 

<213> Helicobacter pylori 



<400> 14 

Met Pro Ala Arg Gin Ser Phe Thr Asp Leu Lys Asn Leu Val Leu Cys 
15 10 15 
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Asp lie Gly Asn 

20 

Ser Ser Ala Lys 
35 

Phe Tyr lie Ser 
50 

Tyr Pro Asn Ala 
65 

Tyr Val Gly Leu 



Asn Gly Val Val 

100 

Lys Glu Gly Lys 
115 

Tyr lie His Ala 
130 

Lys Ala Leu Asp 
145 

Val Asn Tyr Gly 



Ala Lys Asn Gin 

180 

Ser Ala Phe Leu 
195 

Gly Met Glu lie 
210 



<210> 15 
<211> 267 
<212> PRT 
<213> Bordetell 

<400> 15 
Met lie lie Leu 
1 

Phe Asp Pro Asp 

20 

Phe Asp Asn Leu 
35 

Pro Arg Arg Pro 
50 

Arg Gly Glu Ala 
65 



Thr Arg lie His 



Glu Asp Leu Lys 

40 

Val Asn Glu Glu 
55 

Lys Asn lie Ala 
70 

Gly lie Asp Arg 
85 

Val Asp Ala Gly 



His Leu Gly Gly 

120 

Tyr Lys Lys Ser 
135 

Ser Leu Glu Val 
150 

Met Val Leu Ser 
165 

Lys lie Tyr Leu 



Pro His Ser Val 

200 

Ala Leu Lys Lys 
215 



pertussis 



lie Asp Ser Gly 
5 

Ala Pro Gin Ala 



Asp Leu Asp Ala 

40 

Gin Arg Ala Leu 
55 

lie Ala Ala Thr 
70 



Phe Ala Gin Asn 
Arg Leu Gly lie 



Asn Glu Lys Ala 

60 

Gly Phe Phe His 
75 

Gin Met Ala Cys 
90 

Ser Ala lie Thr 
105 

Cys lie Leu Pro 



Ala Lys lie Leu 

140 

Leu Pro Lys Ser 
155 

Val lie Ala Cys 
170 

Cys Gly Gly Asp 
185 

Cys Lys Glu Arg 



Ala Gly lie Leu 

220 



Asn Ser Arg Leu 
10 

Ala Arg Glu Pro 
25 

Leu Gly Arg Trp 



Gly Val Asn Val 

60 

Leu Arg Ala Gly 
75 



Tyr Gin Leu Phe 
30 

Gin Lys Glu lie 
45 

Leu Leu Asn Cys 



Leu Glu Thr Asp 

80 

Leu Ala Val Asn 
95 

lie Asp Leu lie 
110 

Gly Leu Ala Gin 
125 

Glu Gin Pro Phe 



Thr Arg Asp Ala 

160 

lie Gin His Leu 
175 

Ala Lys Tyr Leu 
190 

Leu Val Phe Asp 
205 

Glu Cys Lys 



Lys Val Gly Trp 
15 

Ala Pro Val Ala 
30 

Leu Ala Thr Leu 
45 

Ala Gly Leu Ala 



Gly Cys Asp lie 

80 
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7 

5 i 



Arg Trp Leu Arg Ala Gin Pro Leu Ala Met Gly Leu Arg Asn Gly Tyr 

85 90 95 

Arg Asn Pro Asp Gin Leu Gly Ala Asp Arg Trp Ala Cys Met Val Gly 

100 105 110 

Val Leu Ala Arg Gin Pro Ser Val His Pro Pro Leu Leu Val Ala Ser 
115 120 125 

Phe Gly Thr Ala Thr Thr Leu Asp Thr lie Gly Pro Asp Asn Val Phe 
130 135 140 

Pro Gly Gly Leu lie Leu Pro Gly Pro Ala Met Met Arg Gly Ala Leu 
145 150 155 160 

Ala Tyr Gly Thr Ala His Leu Pro Leu Ala Asp Gly Leu Val Ala Asp 

165 170 175 

Tyr Pro lie Asp Thr His Gin Ala He Ala Ser Gly He Ala Ala Ala 

180 185 190 

Gin Ala Gly Ala He Val Arg Gin Trp Leu Ala Gly Arg Gin Arg Tyr 
195 200 205 

Gly Gin- Ala Pro Glu He Tyr Val Ala Gly Gly Gly Trp Pro Glu Val 
210 215 220 

Arg Gin Glu Ala Glu Arg Leu Leu Ala Val Thr Gly Ala Ala Phe Gly 
225 230 235 240 

Ala Thr Pro Gin Pro Thr Tyr Leu Asp Ser Pro Val Leu Asp Gly Leu 

245 250 " 255 

Ala Ala Leu Ala Ala Gin Gly Ala Pro Thr Ala 

260 265 



<210> 16 
<211> 702 
<212> DNA 

<213> Bacillus subtilis 



<220> 
<221> 
<222> 



CDS 
(1) 



(699) 



<400> 16 
ttg tta ctg 
Met Leu Leu 
1 



gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 
Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
5 10 15 



48 



tat cat gat 
Tyr His Asp 



gga aaa 
Gly Lys 
20 



tta gaa tat cac tgg cgt ata gaa aca age agg 
Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 

25 30 



96 



cat 
His 



aaa 
Lys 



aca 
Thr 
35 



gaa gat 
Glu Asp 



gag ttt ggg atg att ttg cgc tec tta ttt gat 
Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
40 45 



144 



cac tec ggg ctt atg ttt gaa cag ata gat ggc att att att teg tea 
His Ser Gly Leu Met Phe Glu Gin He Asp Gly He He He Ser Ser 



192 
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50 55 60 

gta gtg ccg cca ate atg ttt gcg tta gaa aga atg tgc aca aaa tac 
Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

ttt cat ate gag cct caa att gtt ggt cca ggt atg aaa acc ggt tta 
Phe His lie Glu Pro Gin lie Val Gly Pro Gly Met Lys Thr Gly Leu 

85 90 95 

aat ata aaa tat gac aat ccg aaa gaa gta ggg gca gac aga ate gta 
Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie Val 

100 105 110 

aat get gtc get gcg ata cac ttg tac ggc aat cca tta att gtt gtc 
Asn Ala Val Ala Ala lie His Leu Tyr Gly Asn Pro Leu lie Val Val 
115 120 125 

gat ttc gga acc gec aca acg tac tgc tat att gat gaa aac aaa caa 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 

tac atg ggc ggg gcg att gec cct ggg att aca att teg aca gag gcg 
Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 

ctt tac teg cgt gca gca aag ctt cct cgt ate gaa ate. acc egg ccc 
Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 

165 170 175 

gac aat att ate gga aaa aac act gtt age gcg atg caa tct gga att 
Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 

180 185 190 

tta ttt ggc tat gtc ggc caa gtg gaa gga ate gtt aag cga atg aaa 
Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 205 

tgg cag gca aaa cag gac cca agg tea ttg cga cag gag gec tgg cgc 
Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg Gin Glu Ala Trp Arg 
210 215 220 

cgc tea ttg cga acg aat cag att gta tag 
Arg Ser Leu Arg Thr Asn Gin He Val 
225 230 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



702 



<210> 17 
<211> 233 
<212> PRT 

<213> Bacillus subtilis 
<400> 17 

Met Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 

20 25 30 

His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 
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His Ser Gly Leu 
50 

Val Val Pro Pro 
65 

Phe His lie Glu 



Asn lie Lys Tyr 

100 

Asn Ala Val Ala 
115 

Asp Phe Gly Thr 
130 

Tyr Met Gly Gly 
145 

Leu Tyr Ser Arg 



Asp Asn lie lie 

180 



Met Phe Glu Gin 
55 

lie Met Phe Ala 
70 

Pro Gin lie Val 
85 

Asp Asn Pro Lys 



Ala lie His Leu 

120 

Ala Thr Thr Tyr 
135 

Ala lie Ala Pro 
150 

Ala Ala Lys Leu 
165 

Gly Lys Asn Thr 



lie Asp Gly lie 

60 

Leu Glu Arg Met 
75 

Gly Pro Gly Met 
90 

Glu Val Gly Ala 
105 

Tyr Gly Asn Pro 



Cys Tyr lie Asp 

140 

Gly lie Thr lie 
155 

Pro Arg lie Glu 
170 

Val Ser Ala Met 
185 



lie lie Ser Ser 



Cys Thr Lys Tyr 

80 

Lys Thr Gly Leu 
95 

Asp Arg lie Val 
110 

Leu lie Val Val 
125 

Glu Asn Lys Gin 



Ser Thr Glu Ala 

160 

lie Thr Arg Pro 
175 

Gin Ser Gly lie 
190 



Leu Phe Gly Tyr Val Gly Gin Val Glu Gly lie Val Lys Arg Met Lys 

195 200 205 

Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg Gin Glu Ala Trp Arg 

210 215 220 

Arg Ser Leu Arg Thr Asn Gin lie Val 
225 230 



<2i:o> 18 

<211> 163 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (113) . . (118) 

<220> 

<221> -10_signal 
<222> (136) . . (141) 

<400> 18 

gcctacctag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 60 
tacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 
atctacaagg tgtggtataa taatcttaac aacagcagga cgc 163 
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<210> 19 
<211> 194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (136) . . (141) 

<220> 

<221> -10_signal 
<222> (159) . . (164 ) 

<400> 19 

gctattgacg acagctatgg ttcactgtcc accaaccaaa actgtgctca gtaccgccaa 60 

tatttctccc ttgaggggta caaagaggtg tccctagaag agatccacgc tgtgtaaaaa 120 

ttttacaaaa aggtattgac tttccctaca gggtgtgtaa taatttaatt acaggcgggg 180 

gcaaccccgc ctgt * 194 



<210> 20 
<2il> 248 
<212> PRT 

<213> Pseudomonas aeruginosa 
<400> 20 

Met lie Leu Glu Leu Asp Cys Gly Asn Ser Leu lie Lys Trp Arg Val 
15 10 15 

lie Glu Gly Ala Ala Arg Ser Val Ala Gly Gly Leu Ala Glu Ser Asp 

20 25 30 

Asp Ala Leu Val Glu Gin Leu Thr Ser Gin Gin Ala Leu Pro Val Arg 
35 40 45 

Ala Cys Arg Leu Val Ser Val Arg Ser Glu Gin Glu Thr Ser Gin Leu 
50 55 60 

Val Ala Arg Leu Glu Gin Leu Phe Pro Val Ser Ala Leu Val Ala Ser 
65 70 75 80 

Ser Gly Lys Gin Leu Ala Gly Val Arg Asn Gly Tyr Leu Asp Tyr Gin 

85 90 95 

Arg Leu Gly Leu Asp Arg Trp Leu Ala Leu Val Ala Ala His His Leu 

100 105 110 

Ala Lys Lys Ala Cys Leu Val lie Asp Leu Gly Thr Ala Val Thr Ser 
115 120 125 

Asp Leu Val Ala Ala Asp Gly Val His Leu Gly Gly Tyr lie Cys Pro 
130 135 140 
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Gly Met Thr Leu Met Arg 
145 150 

Arg Tyr Asp Asp Ala Glu 

165 

Gly Gin Ala Thr Ala Glu 

180 

Arg Gly Phe Val Arg Glu 
195 

Pro Asp Cys Glu lie Phe 
210 

Asp Glu Leu Ala Gly Ala 
225 230 

Leu Ala Leu Ala Cys Pro 

245 



Ser Gin Leu Arg Thr 

155 

Ala Arg Arg Ala Leu 

170 

Ala Val Glu Arg Gly 
185 

Gin Tyr Ala Met Ala 
200 

Leu Thr Gly Gly Asp 
215 

Arg lie Met Pro Asp 

235 

lie Glu 



His Thr Arg Arg lie 

160 

Ala Ser Leu Gin Pro 

175 

Cys Leu Leu Met Leu 
190 

Cys Glu Leu Leu Gly 
205 

Ala Glu Leu Val Arg 
220 

Leu Val Phe Val Gly 

240 



<210> 21 
<211> 209 
<212> PRT 

<213> Campylobacter jejuni 
<400> 21 

Met Leu Leu Cys Asp lie Gly Asn Ser Asn Ala Asn Phe Leu Asp Asp 
15 10 15 

Asn Lys Tyr Phe Thr Leu Asn lie Asp Gin Phe Leu Glu Phe Lys Asn 

20 25 30 

Glu Gin Lys lie Phe Tyr lie Asn Val Asn Glu His Leu Lys Glu His 
35 40 45 

Leu Lys Asn Gin Lys Asn Phe lie Asn Leu Glu Pro Tyr Phe Leu Phe 
50 " 55 60 

Asp Thr lie Tyr Gin Gly Leu Gly lie Asp Arg lie Ala Ala Cys Tyr 
65 70 75 80 

Thr lie Glu Asp Gly Val Val Val Asp Ala Gly Ser Ala lie Thr lie 

85 90 95 

Asp lie lie Ser Asn Ser lie His Leu Gly Gly Phe lie Leu Pro Gly 

100 105 110 

lie Ala Asn Tyr Lys Lys lie Tyr Ser His lie Ser Pro Arg Leu Lys 
115 120 125 

Ser Glu Phe Asn Thr Gin Val Ser Leu Asp Ala Phe Pro Gin Lys Thr 
130 135 140 

Met Asp Ala Leu Ser Tyr Gly Val Phe Lys Gly lie Tyr Leu Leu lie 
145 150 155 160 

Lys Asp Ala Ala Gin Asn Lys Lys Leu Tyr Phe Thr Gly Gly Asp Gly 

165 170 175 
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Gln Phe Leu Ala Asn Tyr Phe Asp His Ala lie Tyr Asp Lys Leu Leu 

180 185 190 

lie Phe Arg Gly Met Lys Lys lie lie Lys Glu Asn Pro Asn Leu Leu 
195 200 205 



Tyr 



<210> 22 
<211> 592 
<212> PRT 
<213> Neisseria 

<400> 22 
Met Thr Val Leu 
1 

Asp Gly Leu Pro 

20 

Lys Pro Gin Gin 
35 

Arg Gly Leu Leu 
50 

Leu Ala Val Phe 
65 

Gly Phe Gin Thr 



lie Leu Glu Leu 

100 

Cys Val Thr His 
115 

Trp Ser His Arg 
130 

Phe Asp Arg Pro 
145 

Val Ala Cys Arg 



Lys Trp Pro Asn 

180 

Leu lie Glu Thr 
195 

lie Gly lie Asn 
210 



meningitidis 



Lys Pro Ser His 
5 

Gin His Val Ser 



Leu Asn Gly Phe 

40 

Arg Gin His Asp 
55 

Asp Ala Glu Gly 
70 

Ala Leu Lys His 
85 

Ala Arg lie Ala 



Leu Gin Ser Lys 

120 

Leu Gly Glu Cys 
135 

Gin Tyr Glu Leu 
150 

Arg Ala Leu Ser 
165 

Asp Leu Val Val 



Val Arg Thr Gly 

200 

Phe Val Leu Pro 
215 



Trp Arg Val Leu 
10 

Gin Leu Ala Arg 
25 

Trp Gin Gin Met 



Gly Tyr Trp Arg 

60 

Leu Arg Glu Leu 
75 

Glu Cys Ala Ser 
90 

Pro Asp Lys Ala 
105 

Gly Arg Gly Arg 



Leu Met Phe Ser 

140 

Gly Ser Leu Ser 
155 

Arg Leu Gly Leu 
170 

Gly Arg Asp Lys 
185 

Gly Lys Thr Val 



Lys Glu Val Glu 

220 



Ala Glu Leu Ala 
15 

Met Ala Asp Met 
30 

Pro Ala His lie 
45 

Leu Val Arg Pro 



Gly Glu Arg Ser 

80 

Ser Asn Asp Glu 
95 

His Lys Thr lie 
110 

Gin Gly Arg Lys 
125 

Phe Gly Trp Val 



Pro Val Ala Ala 

160 

Lys Thr Gin lie 
175 

Leu Gly Gly lie 
190 

Ala Val Val Gly 
205 

Asn Ala Ala Ser 



Val Gin Ser Leu Phe Gin Thr Ala Ser Arg Arg Gly Asn Ala Asp Ala 
225 230 235 240 
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Ala Val Leu Leu Glu Thr Leu Leu Ala Glu Leu Asp Ala Val Leu Leu 

245 250 255 

Gin Tyr Ala Arg Asp Gly Phe Ala Pro Phe Val Ala Glu Tyr Gin Ala 

260 265 270 

Ala Asn Arg Asp His Gly Lys Ala Val Leu Leu Leu Arg Asp Gly Glu 
275 280 285 

Thr Val Phe Glu Gly Thr Val Lys Gly Val Asp Gly Gin Gly Val Leu 
290 295 300 

His Leu Glu Thr Ala Glu Gly Lys Gin Thr Val Val Ser Gly Glu He 
305 310 315 320 

Ser Leu Arg Ser Asp Asp Arg Pro Val Ser Val Pro Lys Arg Arg Asp 

325 330 335 

Ser Glu Arg Phe Leu Leu Leu Asp Gly Gly Asn Ser Arg Leu Lys Trp 

340 345 350 

Ala Trp Val Glu Asn Gly Thr Phe Ala Thr Val Gly Ser Ala Pro Tyr 
355 360 365 

Arg Asp Leu Ser Pro Leu Gly Ala Glu Trp Ala Glu Lys Val Asp Gly 
370 375 380 

Asn Val Arg He Val Gly Cys Ala Val Cys Gly Glu Phe Lys Lys Ala 
385 390 395 400 

Gin Val Gin Glu Gin Leu Ala Arg Lys He Glu Trp Leu Pro Ser Ser 

405 410 415 

Ala Gin Ala Leu Gly He Arg Asn His Tyr Arg His Pro Glu Glu His 

420 425 430 

Gly Ser Asp Arg Trp Phe Asn Ala Leu Gly Ser Arg Arg Phe Ser Arg 
435 440 445 

Asn Ala Cys Val Val Val Ser Cys Gly Thr Ala Val Thr Val Asp Ala 
450 455 460 

Leu Thr Asp Asp Gly His Tyr Leu Gly Gly Thr He Met Pro Gly Phe 
465 470 475 480 

His Leu Met Lys Glu Ser Leu Ala Val Arg Thr Ala Asn Leu Asn Arg 

485 490 495 

His Ala Gly Lys Arg Tyr Pro Phe Pro Thr Thr Thr Gly Asn Ala Val 

500 505 510 

Ala Ser Gly Met Met Asp Ala Val Cys Gly Ser Val Met Met Met His 
515 520 525 

Gly Arg Leu Lys Glu Lys Thr Gly Ala Gly Lys Pro Val Asp Val He 
530 535 540 

He Thr Gly Gly Gly Ala Ala Lys Val Ala Glu Ala Leu Pro Pro Ala 
545 550 555 560 
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Phe Leu Ala Glu Asn Thr Val Arg Val Ala Asp Asn Leu Val lie His 

565 570 575 

Gly Leu Leu Asn Leu lie Ala Ala Glu Gly Gly Glu Ser Glu His Thr 

580 585 590 



<210> 23 
<211> 753 
<212> DNA 

<213> Clostridium acetobutylicum 
<400> 23 

aataagagag cagcttttat gctgctctta tttttaagga gtgtattaaa agtgatttta 60 
gttttagatg ttggcaatac taatatagtg ttaggaatat acaatgatac gaaacttaca 120 
gctgaatgga gactatcaac agatgtatta agatctgctg acgaatatgg aattcaagta 180 
atgaacttat ttcaacaaga taagctcgat ccaacattag ttgagggagt aataatatcc 240 
tctgttgtac ctaatatcat gtattcttta gaacatatga taagaaagta ctttaagata 300 
aatccattag ttgttggacc tggaataaaa acaggaatta atattaaata cgataatcct 360 
aaagaagttg gagccgacag aattgtaaat gctgtagcag cacatgaaat ttataaaaga 420 
tctcttataa taatagattt tggaacagca actacatttt gtgcagtaag agaaaatgga 480 
gattatcttg gtggagcaat atgccctgga attaaagttt catcagaggc tctttttgaa 540 
aaggcagcta agcttccaag agtagagctc ataaaaccag cgtatgctat ttgtaaaaat 600 
actatttcaa gtatacaatc tggaattgtt tatcgatacc tacgtcaggt aaaatactta 660 
tttgaaaaat tgaaagaaaa cctgccggac ggaaggagaa caaggacctc cttggtattg 720 
gccacaggtg gtcttgccaa acttattaat tga ' 753 

<210> 24 
<211> 798 
<212> DNA 

<213> Streptomyces coelicolor 
<400> 24 

atgctgctga cgatcgacgt agggaacacg cacaccgtcc tcggcctctt cgacggcgag 60 

gacatcgtcg agcactggcg catctccacg gactcgcgcc gcacggccga cgaactggcg 120 

gtgctcctcc agggcctcat gggcatgcat cccctcctcg gcgacgaact gggcgacggc 180 

atcgacggca tcgccatctg cgcgacggtc ccctccgtcc tccacgaact gcgcgaggtc 240 

acccgccgct actacggcga cgtccccgcg gtcctcgtcg aaccgggcgt caagaccggc 300 

gtcccgatcc tcaccgacca ccccaaggag gtcggcgccg accgcatcat caacgcggta 360 

gcggccgtgg agctctacgg cggcccggcg atcgtcgtgg acttcggcac ggcgacgacg 420 

ttcgacgcgg tcagcgcgcg cggggagtac atcggcggcg tcatcgcccc cggcatcgag 480 

atctcggtcg aggcgctggg cgtcaagggc gcccagctcc gcaagatcga ggtggcgcgc 540 

ccccgcagcg tgatcggcaa gaacacggtc gaggcgatgc agtccggcat cgtgtacggc 600 

ttcgccggcc aggtcgacgg cgtcgtcaac cgcatggcgc gggagctggc cgacgacccg 660 

gacgacgtga cggtcatcgc gacgggcggg ctggcgccga tggtcctggg cgagtcctcg 720 

gtcatcgacg agcacgagcc gtggctgacg ctgatgggtc tgcgcctggt gtacgagcgc 780 

aacgtgtcgc gcatgtag 798 

<210> 25 
<211> 819 
<212> DNA 

<213> Mycobacterium tuberculosis 
<400> 25 

gtgctgctgg cgattgacgt ccgcaacacc cacaccgttg tgggcctgct gtccggaatg 60 
aaagagcacg caaaggtcgt gcagcagtgg cggatacgca ccgaatccga agtcaccgcc 120 
gacgaactgg cactgacgat cgacgggctg atcggcgagg attccgagcg gctcaccggt 180 
accgccgcct tgtccacggt cccgtccgtg ctgcacgagg tgcggataat gctcgaccag 240 
tactggccgt cggtgccgca cgtgctgatc gagcccggag tacgcaccgg gatccctttg 300 
ctcgtcgaca acccgaagga agtgggcgca gaccgcatcg tgaactgttt ggccgcctat 360 
gaccggttcc ggaaggccgc catcgtcgtt gactttggat cctcgatctg tgttgatgtt 420 
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gtatcggcca agggtgaatt tcttggcggc gccatcgcgc ccggggtgca ggtgtcttcc 480 
gatgccgcgg cggcccgctc ggcggcattg cgccgcgttg aacttgcccg cccacgttcg 540 
gtggttggca agaacaccgt cgaatgcatg caagccggtg cggtgttcgg cttcgccggg 600 
ctggtagacg ggttggtagg ccgcatccgc gaggacgtgt ccggtttctc cgtcgaccac 660 
gatgtcgcga tcgtggctac cgggcatacc gcgcccctgc tgctgccgga attgcacacc 720 
gtcgaccatt acgaccagca cctgaccttg cagggtctgc ggctggtgtt cgagcgtaac 780 
ctcgaagtcc agcgcggccg gctcaagacg gcgcgctga 819 



600 
660 



<210> 26 
<211> 777 
<212> DNA 

<213> Rhodobacter capsulatus 
<400> 26 

atgcttttgt gcatcgactg cggcaacacc aacaccgtgt tttcggtctg ggacgggacg 60 
gatttcgccg ccacctggcg catcgccacc gatcatcgcc gcaccgccga cgaatatttc 120 
gtctggctga acacgctgat gcaactgaag ggcctgcagg gccggatctc cgaggcgatc 180 
atctcctcga ccgcgccgcg ggtggtgttc aacctgcgcg ttctgtgcaa ccgctatttc 240 
gactgccgcc cctatgtcgt cggcaaaccg ggctgcgagc tgccggtggc gccgcgcgtc 300 
gatccgggca ccacggtcgg gccggaccgg ctggtcaata cggtggcggg ctatgaccgt 360 
catggcggcg atctgatcgt cgtcgatttc ggcaccgcca ccacctttga cgtggtggcc 420 
cccgatggcg cctatatcgg cggggtgatc gcgcccgggg tgaacctgag ccttgaggcg 480 
ctgcatatgg cggcggccgc gctgccgcat gtcgacgtca cgaaaccgca aggggtgatc 54 0 
ggcacgaata cggtggcctg catccaatcc ggggtgtatt ggggctatat cggccttgtc 
gaaggcatcg tgcggcagat ccggatggaa cgtgaccgtc cgatgaaggt gattgccacc 
gggggtcttg cctcgctctt cgatctgggt ttcgatctgt tcgacaaggt cgaggatgac 720 
ctgaccatgc atggtctgcg tctgatcttc gattacaaca agggacttgg ggcgtga 777 

<210> 27 
<211> 768 
<212> DNA 

<213> Geobacter sulf urreducens 
<400> 27 

gtgcttcttg ttatagacgt gggtaatacc aatatcgtgc tcgggattta cgatggcgag 60 

cgcctggtga gggattggcg ggtctcca'cg gacaaggccc gtactaccga cgagtacggt 120 

attctcataa atgagttgtt ccgcttggcg ggccttgggc tcgatcagat ccgcgcggtg 180 

atcatctcct cggtggtgcc gcccctcacc ggcgtgctgg agcgtctttc cctggggtat 240 

ttcgggatgc gtcccctggt ggtgggaccg ggcatcaaga caggcatgcc aatccagtac 300 

gacaaccccc gggaggtggg ggccgaccgg atcgtgaacg cggtggcggg gtacgagaag 360 

taccgcacct ctctcattat cgtcgatttc ggcaccgcta ccacgttcga ctacgtgaac 420 

cgcaagggag agtactgcgg aggggccatc gcgccgggac tcgtcatttc caccgaggcc 480 

ctgttccagc gggccagcaa gctgcccagg gttgatatca tacgtccgtc cgcgatcatt 540 

gccaggaaca cggtcaattc gatgcaggcg ggaatttact atggttacgt ggggctcgta 600 

gacgagatcg tcacccggat gaaggccgag agcaaggatg cgccccgggt tatcgctacc 660 

ggagggttgg cgtccctcat agcgccggag tccaagacca tcgaagccgt cgaggaatat 720 

ctgacactgg agggattgcg catactgtac gaacgaaaca gggagtga 768 



<210> 28 
<211> 789 
<212> DNA 

<213> Deinococcus radiodurans 



<400> 28 

gtgcccgctt ttcccctgct cgccgtggac 
gccgacgcct cgggcgccct gacccacacc 
cccgacgacc tcgcgctgca actgcacggg 
cgcgccgccg tgctgagcag cgtggcgccc 
aagcggcact tcatgatcga cgcttttgcc 
gtggaactcg acacgccggg ctcggtgggt 
gaaaagtacc tgggggggct ggactacgcg 



atcggcaaca ccaccaccgt cctgggtctg 60 
tggcggattc ggaccaaccg cgagatgctg 120 
ctctttaccc tcgccggggc gccgattccc 180 
ccggtgggcg aaaactacgc gctcgcgctc 24 0 
gtgagtgccg agaacctgcc cgacgtgacg 300 
gcggaccgcc tgtgcaacct cttcggcgcc 360 
gtggtagtgg atttcgggac ctccaccaac 420 
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tttgacgtgg 
gtcagcgccg 
cccgagacgg 
tacgccgaga 
gtcgccgtcg 
tactacgacg 
gtccgctga 



tggggcgggg 
acgccctgtt 
ccatcggcaa 
tggtggacgg 
ccactggcgg 
aaacgctgac 



gcggcgtttc 
cgcccgcgcc 
aaacaccgtc 
cctgctgcgc 
cttctcgcgc 
gttgcgcggg 



ctcggcggca 
gccaaactgc 
cacgcgctgc 
cgcatccgcg 
accgtgcagg 
ttggtggagc 



tcctcgccac 
cgcgcatcac 
aatcgggcct 
ccgagttgcc 
ggatttgcca 
tgtgggcgag 



cggagcgcag 
cctgcaagcg 
ggtcttcggc 
gggcgaagcg 
ggaaatcgac 
ccgttcggag 



480 
540 
600 
660 
720 
780 
789 



<210> 29 
<211> 741 
<212> DNA 

<213> Thermotoga maritima 



<400> 29 

ttgtacctcc 

aaaactttca 

tcacacct tc 

tccgtcgttc 

tcaccgat at 

gaagtgggtg 

ggaatcatca 

gaaggaggag 

gcaaaacttc 

gaaaacatca 

cgaataaagg 

gtgaaagata 

catttctgct 



tcgtggacgt 
gaaggtggag 
atcctcttct 
ccactcagaa 
gggtgaaggc 
ctgatagggt 
tcgacatggg 
ccattttgcc 
cgctcgttga 
ggctgggtgt 
aagtttacgg 
tgataaaaca 
tcggagattg 



gggt aacacg 
gctgtccacc 
gggcgatgct 
cacagtcata 
gaaaaacgga 
ggccaacgtt 
aacggcaacc 
tggattcttc 
ggtaaaacca 
ggtgaacgga 
tgatttaccg 
cgagattttc 
a 



cattctgtct 
ggtgtgtttc 
atgcgtgaga 
gagcgttttt 
tgtgtgaaat 
gtcgctttcg 
accgtggatc 
atgatggttc 
gcggattttg 
agtgtctacg 
gtggttctca 
gatgaggacc 



tctctatcac 
agacggaaga 
taaaggggat 
ctcaaaagta 
ggaacgtgaa 
tcaaggaata 
ttgttgtgaa 
actcgctctt 
ttgt aggaaa 
ctcttgaggg 
cgggaggtca 
tcacgatcaa 



cgaagatggt 
cgaactcttt 
aggagtggcc 
tttccacata 
gaatccctcg 
cggtaaaaac 
cggatcttac 
tcggggaacg 
ggatacggag 
gataataggg 
gt cgaagatc 
gggggtgtac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

741 



<210> 30 
<211> 822 
<212> DNA 

<213> Treponema pallidum 



<400> 30 

atgcttttga 

ggccgtgtgt 

gagtactcgc 

cgtgatgcgt 

gctcagatta 

gtgcgcatac 

gcggcctatg 

tttacggcgg 

actgcggtgc 

cctgattccg 

acgctctttg 

gcagcggtga 

atcgatgcac 

ctcctgccac 



tagacgtagg 
gcgtgcgtga 
ttctcatcca 
ttatttcctc 
gcggcgtcca 
cagagccagt 
tgcatttccg 
tggatggcac 
agtctctcca 
ttctgggcaa 
ttattcgcgc 
taacgggggg 
agctgacgct 
ctgctacagt 



gaactcgcac 
gttgtttcgc 
tgcgct ttgc 
cgtcgtgcct 
gccggttgt c 
gcgcgcggaa 
ttctgcttgc 
ggggttgat t 
tacaggaacg 
ggatactacg 
tatgattgca 
gctttcgcgt 
ctcaggtctt 
gtcaggttca 



gtagtgttcg 
ct tgcgcctg 
gaacgtgcgg 
gtgttgacaa 
tttggcccgt 
attggcactg 
gtggtagtgg 
caaggggtgg 
gcacaattac 
catgcggtgc 
cagtgtcaga 
cttttctcgt 
gcacatattg 
tcggggaatt 



gaatccaagg 
acgcgcgtaa 
gggtcggccg 
agaccattgc 
gggcgtacga 
acttggtagc 
attgtggaac 
caattgcgcc 
cacttgttcc 
aggcgggtgt 
aagagttagg 
cagaggtgga 
cgcggctggt 
ga 



cgagaatggt 
aacccaagat 
tgcttctctc 
agatgcggtc 
gcacttgccg 
caacgcggtg 
agcgctcacc 
t'ggtctgcgc 
tcttgccctg 
ggtgcggggc 
gtgccgctgt 
ctttcct cct 
gccgacatct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

822 



<210> 31 
<211> 789 
<212> DNA 

<213> Borrelia burgdorferi 

gaaataccag cattgctttt 60 
tgaaaacaaa tcttatgtta 120 
attttaatgt aaataaagtt 180 
aaaatgtcat tttttctttt 240 
atgatttgac atttaatcct 300 



<400> 31 
atgaataaac 
gccttattta 
aggtatgatg 
tttataagca 
tttaagataa 



ctttattatc 
aagataatca 
aggtttatag 
gcgttgttcc 
agcctttgtt 



agaattgata 
agttaattta 
cttttttgaa 
tattcttaat 
tattggtttt 



attgatattg 
tttattaaaa 
gaaaattttg 
gaaacat tta 
gatttgaatt 
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tacaaaagcg 
gaaaattatt 
gctgttagca 
tttaattctt 
aatcttttag 
tatttaatag 
ataattactg 
aatattcatt 
gttaattga 



ataaattttt 
catttgaaaa 
ggcaagatgg 
tattagataa 
agagaacgac 
aaggtgttta 
ggggtaatgc 
taactgtaga 



gctaggttca 
tgttttagta 
aatactcggt 
tgcctatctt 
atctgggagt 
tcgtgatatt 
ggacctaatt 
aggcgttaga 



gacgtttttg 
gtagaccttg 
ggtattataa 
atcaaaaaat 
gtaaacagcg 
aagcagatgt 
ttgtcattaa 
attttaggaa 



ccaatcttgt 
gaactgcttg 
attctggtcc 
tccccattag 
gtttatttta 
ataaaaaaaa 
ttgagataga 
attctattga 



tgcagccatt 
caccattttt 
tttgataaat 
cactccaaat 
tcaatataag 
atttaattta 
gtttattttt 
ctttaagttt 



<210> 32 
<211> 690 
<212> DNA 

<213> Aquifex aeolicus 



<400> 32 

atgaggtttt 

aagaaagtaa 

aaattaaaag 

ataccgaaga 

cctgaaacgc 

aagaatgttg 

aaatttaagg 

ctggcggagg 

acacgagagt 

ctgaaactct 

gggaagtact 

aatttacttt 



tgacggtaga 
aagattttct 
cgctcggaat 
taaagttttt 
tgggcacgga 
tagtaatcag 
gagggtttat 
gaattcccga 
gcgtcctggg 
ggagaaaagt 
tttccaagtt 
acctctatca 



cgtagggaat 
gaaactttca 
atccgtaaaa 
aaagaaggaa 
cagggtagca 
tgcgggtact 
taccttagga 
gttttttccc 
aggggcttac 
atttaaaaga 
cggtatttac 
caggatttaa 



tcctccgttg 
cacgaagaat 
cagagtttta 
aacttt ccta 
cttgcttact 
gcccttgtaa 
cttggaaaga 
gaagaggtag 
agggagagca 
aagttcaaag 
gacccactcc 



atatcgccct 
ttttaaagga 
gcgaaaaagt 
tacaggttga 
ccgccaaaaa 
ttgacctagt 
agttaaaaat 
aaatttttct 
cagaatttat 
tggtt at aac 
ttgttcacag 



atgggaaggg 
agaatttcct 
aaggggaaaa 
ttacaaaact 
gttttacgga 
tcttgagggc 
tctttccgac 
tgggcgttct 
taaaagtaca 
gggcggagag 
gggcatgaga 



360 
420 
480 
540 
600 
660 
720 
780 
789 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

690 



<210> 33 
<211> 774 
<212> DNA 

<213> Synechocystis sp . 



<400> 33 

gtggaaacat 

t taggcctaa 

ctgcaaacct 

aaagttcctc 

cagcctaaaa 

attgaccggg 

gttgatggag 

ggagcgatct 

ctaccgaagt 

cccagcgcca 

caggattggc 

atattacatg 

ttgatcttcc 



caaagccggg 
tgataggcaa 
gggttacaga 
tgatgttggc 
ttttgaccct 
ccctggctgg 
gcactgcttt 
tgcccggttt 
tagaaatgga 
tttttagtgg 
aaaagctttt 
gcttcctaaa 
tcggtatggc 



ttgtggttta 
ctcccgtctg 
ttacaacccc 
atcggtggta 
gaagaat ctt 
tttagggacg 
gaccattaca 
gggattgcag 
tcaattaacc 
tgttgtctat 
tcctggtgcc 
agagcattct 
ggccatacac 



gccctggata 
cactgggcat 
aagtcagctc 
ccggaacaaa 
cccctggtca 
gggctgacct 
gggtttgacc 
ttagcaaccc 
gagttgcctg 
ggcgtgttgg 
gccatggtta 
cctaatcttt 
cacggcgatc 



atgacaagca 
attgtagcgg 
agttgccggt 
ccgaagtttg 
atctttaccc 
acggctttcc 
aagataaaaa 
ttggcgatcg 
accgttgggc 
gggcattgca 
tcaccggggg 
cggtggcctg 
gccccatctg 



aaaaccttgg 60 
caatgctccc 120 
tttgttgggg 180 
gcgagtatat 240 
cagctttggc 300 
ctgtctagtg 360 
actggtgggg 420 
cctggcggcc 480 
tttagatacc 540 
gagttatctc 600 
agacggcaag 660 
ggatgacaat 720 
ttag 774 



<210> 34 
<211> 672 
<212> DNA 

<213> Helicobacter pylori 
<400> 34 

atgccagcta ggcaatcttt caaggattta aa 
acacgcatcc atttcgcgca aaactaccag ct 
cgtttgggta ttcaaaagga aattttttac at 
cttttaaatt gttaccctaa cgctaaaaat at 
tatatagggc ttgggataga ccggcaaatg gc 



agacttga ttttatgcga tataggcaac 60 
cttttcaa gcgctaaaga agatttaaag 120 
tagtgtga atgaagaaaa tgaaaaagct 180 
cgcagggt tttttcattt agaaaccgac 240 
atgtttag cggtggttaa tggggttata 300 
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gtggatgctg 
tgtattttgc 
gagcaacctt 
gtgaattacg 
aaaatctatc 
tgcaaggagc 
gaatgcaaat 



ggagcgcgat 
ccggtttagc 
tcaaagcctt 
gcatgatttt 
tttgtggggg 
gtttggtttt 
ga 



tacgattgat 
ccaatatgtc 
agattcttta 
gagtatcatc 
cgatgcgaaa 
tgacgggatg 



<210> 35 

<211> 747 

<212> DNA 

<213> Pseudomonas 



ttagtcaaag 
catgcgtata 
gaagttttac 
tcttgtatcc 
tatttgagcg 
gaaatcgctc 



agggcaagca 
aaaaaagcgc 
ccaaaaacac 
aacatttagc 
cgtttttacc 
ttaaaaaagc 



tttaggaggg 
gaaaatctta 
cagagacgct 
taaagatcaa 
tcattctgtt 
agggatacta 



360 
420 
480 
540 
600 
660 
672 



aeruginosa 



<400> 35 

atgattcttg 

gcgcggtcgg 

tcgcagcaag 

acctcgcaac 

tccggcaagc 

gaccgctggc 

gatctgggga 

tacatatgcc 

cgctacgacg 

gccgaggcgg 

gccatggcgt 

gaactggttc 

ctggcactgg 



agctcgactg 
tcgccggtgg 
cgctgccagt 
tggtcgcacg 
agttggcggg 
tggccctcgt 
ccgcggtcac 
cgggcatgac 
atgcagaggc 
ttgagcgggg 
gcgagctgct 
gcgacgaact 
cttgcccgat 



tggaaactcg 
ccttgcggag 
gcgagcctgt 
gttggagcag 
tgtgcgcaac 
cgcggctcat 
ctctgacctg 
cctgatgaga 
ccggcgggcg 
ttgtctgctc 
cggtccggat 
ggctggcgcc 
tgagtga 



<210> 36 
<211> 630 
<212> DNA 

<213> Campylobacter jejuni 



<400> 36 

atgttgctct 

actcttaata 

gtcaatgaac 

tattttttat 

actattgaag 

aattctatac 

agccatattt 

ccacaaaaaa 

aaagatgccg 

aattatttcg 

ataaaagaaa 



gtgatattgg 
tagatcagtt 
atctcaaaga 
ttgatacaat 
atggagttgt 
atcttggtgg 
caccacgatt 
ccatggatgc 
ctcaaaataa 
atcacgcaat 
atcccaattt 



gaatt caaat 
tttagaattt 
acatttaaaa 
ttatcaagga 
tgtagatgca 
ttttatcttg 
aaaaagtgaa 
tttaagttat 
aaagctttat 
ttatgat aaa 
actttattaa 



<210> 37 
<211> 1779 
<212> DNA 

<213> Neisseria meningitidis 



<400> 37 

atgacggttt 

caacacgtct 

tggcagcaga 

ctggtgcgcc 

ggttttcaga 

gcgcggattg 

ggcagggggc 

tt tggctggg 

gtggcgtgcc 



tgaagccttc 
cgcaactggc 
tgccggcgca 
cattggcggt 
cggcattgaa 
cgccggacaa 
ggcaggggcg 
tgtttgaccg 
ggcgcgcctt 



gcactggcgg 
gcgtatggcg 
catacgcggg 
tttcgatgcc 
gcacgagtgc 
ggcgcacaaa 
gaagtggtcg 
gccgcagtat 
gtcgcgtttg 



ctgat 
tccga 
cgcct 
ctgtt 
ggcta 
cacct 
gtcgc 
agcca 
cttgc 
atgct 
tgtga 
cggat 



caagt 
tgatg 
ggtga 
cccgg 
tctcg 
ggcta 
ggcgg 
gttgc 
cagtc 
caggg 
aatat 
catgc 



gctaatttcc 
aaaaatgaac 
aatcaaaaaa 
ttaggaatcg 
ggtagtgcta 
ccaggtattg 
tttaacactc 
ggtgttttta 
ttcactggtg 
cttttaatct 



gtgttggcgg 
gatatgaagc 
ctgttgcgcc 
gaaggtttgc 
gcgtccagca 
accatatgtg 
caccgtttgg 
gagttgggtt 
ggtttgaaaa 



ggcgggtcat 
ccctggtcga 
gcgttcgcag 
tttcggcgct 
attaccagcg 
agaaggcctg 
atggagtgca 
gcacccatac 
tccagccagg 
ggttcgttcg 
tcctgacggg 
cggacctggt 



tagatgataa 
aaaaaatttt 
att ttatcaa 
atcgcatagc 
ttacaattga 
caaattataa 
aagttagtct 
aaggaattta 
gagatgggca 
tt cgaggaat 



agcttgccga 
cgcagcagct 
aacacgacgg 
gcgagctggg 
acgacgagat 
tgacccacct 
gcgagtgtct 
cgctgtcgcc 
cgcaaatcaa 



cgagggggcg 
acagtt aacg 
cgagcaggaa 
ggttgcatca 
cctggggctc 
cctggtcatt 
tctggggggc 
ccgacgtatc 
gcaggccacg 
tgagcagtac 
tggggatgcc 
tttcgtaggg 



caaatatt tt 
ttatatcaat 
tcttgaacct 
agcttgttat 
tattatttct 
aaaaatttat 
tgatgcattc 
cctactgata 
atttttagca 
gaaaaagatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

747 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

630 



cggtttgccg 60 
caacggtttt 120 
ctattggcgg 180 
ggaaaggtcg 240 
actggaattg 300 
gcaaagtaag 360 
gatgttcagt 420 
tgttgcggca 480 
gtggccaaac 540 
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gatttggtcg tcggacgcga caaattgggc ggcattctga ttgaaacggt caggacgggc 600 
ggcaaaacgg ttgccgtggt cggtatcggc atcaatttcg tgctgcccaa ggaagtggaa 660 
aacgccgctt ccgtgcaatc gctgtttcag acggcatcgc ggcggggaaa tgccgatgcc 720 
gccgtgttgc tggaaacgct gttggcggaa cttgatgcgg tgttgttgca atatgcgcgg 780 
gacggatttg cgccttttgt ggcggaatat caggctgcca accgcgacca cggcaaggcg 840 
gtattgctgt tgcgcgacgg cgaaaccgtg ttcgaaggca cggttaaagg cgtggacgga 900 
caaggcgttc tgcacttgga aacggcagag ggcaaacaga cggtcgtcag cggcgaaatc 960 
agcctgcggt ccgacgacag gccggtttcc gtgccgaagc ggcgggattc ggaacgtttt 1020 
ctgctgttgg acggcggcaa cagccggctc aagtgggcgt gggtggaaaa cggcacgttc 1080 
gcaaccgtcg gtagcgcgcc gtaccgcgat ttgtcgcctt tgggcgcgga gtgggcggaa 1140 
aaggtggatg gaaatgtccg catcgtcggt tgcgccgtgt gcggagaatt caaaaaggca 1200 
caagtgcagg aacagctcgc ccgaaaaatc gagtggctgc cgtcttccgc acaggctttg 1260 
ggcatacgca accactaccg ccaccccgaa gaacacggtt ccgaccgctg gttcaacgcc 1320 
ttgggcagcc gccgcttcag ccgcaacgcc tgcgtcgtcg tcagttgcgg cacggcggta 1380 
acggttgacg cgctcaccga tgacggacat tatctcgggg gaaccatcat gcccggtttc 1440 
cacctgatga aagaatcgct cgccgtccga accgccaacc tcaaccggca cgccggtaag 1500 
cgttatcctt tcccgaccac aacgggcaat gccgtcgcca gcggcatgat ggatgcggtt 1560 
tgcggctcgg ttatgatgat gcacgggcgt ttgaaagaaa aaaccggggc gggcaagcct 1620 
gtcgatgtca tcattaccgg cggcggcgcg. gcaaaagttg ccgaagccct gccgcctgca 1680 
tttttggcgg aaaataccgt gcgcgtggcg gacaacctcg tcattcacgg gctgctgaac 1740 
ctgattgccg ccgaaggcgg ggaatcggaa catacttaa 1779 



<210> 38 
<211> 804 
<212> DNA 

<213> Bordetella pertussis 
<400> 38 

atgattatcc tcatcgactc cggcaacagc cgcctcaaag tcggctggtt tgacccggac 60 
gcgccgcagg cggcgcgcga gcccgccccc gtcgccttcg acaatctcga cctggacgcg 120 
ctgggccgct ggctggccac cctgcccagg cgcccgcaac gggcgctggg cgtgaacgtc 180 
gccgggcttg cccgcggcga agccattgcc gccacgctgc gcgcgggcgg ttgcgacatc 24C 
cggtggctgc gggcccagcc cctggccatg gggctgcgca acggctatcg caatcccgac 300 
caactgggcg ccgaccgctg ggcgtgcatg gtgggcgtgc tggcgcgcca gccgtccgtg 360 
cacccgccgc tgctggtggc cagtttcggc acggccacca cgctggacac catcgggccc 420 
gacaatgtct ttcccggcgg gctgatcctg cccggccccg ccatgatgcg cggcgcgctg 480 
gcctacggca ccgcccacct gcccctggcc gacggcctgg tggccgacta ccccatcgac 540 
acccatcagg ccatcgccag cggcatcgcc gccgcccagg ccggcgcgat cgtgcggcaa 600 
tggctggccg gccgccaacg ctacggccag gcgccggaga tctatgtcgc cggcggcggg 660 
tggcccgaag tgcggcagga agccgagcgc ctgctggcgg tcaccggcgc cgccttcggc 720 
gccacgccgc agcccactta cctcgacagc cccgtgctcg acggcctggc ggcgctcgcc 780 
gcgcaaggcg cgccaacggc ctga 804 



<210> 39 
<211> 460 
<212> PRT 
<213> Neisseria 

<400> 39 
Met Gly Glu Cys 
1 

Gin Tyr Glu Leu 

20 

Arg Ala Leu Gly 
35 

Asp Leu Val Val 



gonorrhoeae 

Leu Met Phe Ser 
5 

Gly Ser Leu Ser 

Cys Leu Gly Leu 

40 

Gly Arg Asp Lys 



Phe Gly Trp Ala 
10 

Pro Val Ala Ala 
25 

Glu Thr Gin lie 



Leu Gly Gly lie 



Phe Asp Arg Pro 
15 

Leu Ala Cys Arg 
30 

Lys Trp Pro Asn 
45 

Leu lie Glu Thr 
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50 55 60 

Val Arg Ala Gly Gly Lys Thr Val Ala Val Val Gly lie Gly lie Asn 
65 70 75 80 

Phe Val Leu Pro Lys Glu Val Glu Asn Ala Ala Ser Val Gin Ser Leu 

85 90 95 

Phe Gin Thr Ala Ser Arg Arg Gly Asn Ala Asp Ala Ala Val Leu Leu 

100 105 110 

Glu Thr Leu Leu Ala Glu Leu Gly Ala Val Leu Glu Gin Tyr Ala Glu 
115 120 125 

Glu Gly Phe Ala Pro Phe Leu Asn Glu Tyr Glu Thr Ala Asn Arg Asp 
130 135 140 

His Gly Lys Ala Val Leu Leu Leu Arg Asp Gly Glu Thr Val Cys Glu 
145 150 155 160 

Gly Thr Val Lys Gly Val Asp Gly Arg Gly Val Leu His Leu Glu Thr 

165 170 175 

Ala Glu Gly Glu Gin Thr Val Val Ser Gly Glu lie Ser Leu Arg Pro 

180 185 190 

Asp Asn Arg Ser Val Ser Val Pro Lys Arg Pro Asp Ser Glu Arg Phe 
195 200 205 

Leu Leu Leu Glu Gly Gly Asn Ser Arg Leu Lys Trp Ala Trp Val Glu 
210 215 220 

Asn Gly Thr Phe Ala Thr Val Gly Ser Ala Pro Tyr Arg Asp Leu Ser 
225 230 235 240 

Pro Leu Gly Ala Glu Trp Ala Glu Lys Ala Asp Gly Asn Val Arg lie 

245 250 255 

Val Gly Cys Ala Val Cys Gly Glu Ser Lys Lys Ala Gin Val Lys Glu 

260 265 270 

Gin Leu Ala Arg Lys lie. Glu Trp Leu Pro Ser Ser Ala Gin Ala Leu 
275 280 285 

Gly lie Arg Asn His Tyr Arg His Pro Glu Glu His Gly Ser Asp Arg 
290 295 300 

Trp Phe Asn Ala Leu Gly Ser Arg Arg Phe Ser Arg Asn Ala Cys Val 
305 310 315 320 

Val Val Ser Cys Gly Thr Ala Val Thr Val Asp Ala Leu Thr Asp Asp 

325 330 335 

Gly His Tyr Leu Gly Gly Thr lie Met Pro Gly Phe His Leu Met Lys 

340 345 350 

Glu Ser Leu Ala Val Arg Thr Ala Asn Leu Asn Arg Pro Ala Gly Lys 
355 360 365 



Arg Tyr Pro Phe Pro Thr Thr Thr Gly Asn Ala Val Ala Ser Gly Met 
370 375 380 
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Met Asp Ala Val Cys Gly Ser lie Met Met Met His Gly Arg Leu Lys 
385 390 395 400 

Glu Lys Asn Gly Ala Gly Lys Pro Val Asp Val lie lie Thr Gly Gly 

405 410 415 

Gly Ala Ala Lys Val Ala Glu Ala Leu Pro Pro Ala Phe Leu Ala Glu 

420 425 430 

Asn Thr Val Arg Val Ala Asp Asn Leu Val lie His Gly Leu Leu Asn 
435 440 445 

Leu lie Ala Ala Glu Gly Gly Glu Ser Glu His Ala 
450 455 460 



<210> 40 
<211> 1383 
<212> DNA 

<213> Neisseria gonorrhoeae 



<400> 40 

ttgggcgagt 

ggttcgctgt 

gaaacgcaaa 

ctgattgaaa 

ttcgtgctgc 

tcgcggcggg 

gcggtgttgg 

gccaaccgcg 

ggcacggt ta 

cagacggtcg 

aagcggccgg 

gcgtgggtgg 

cctttgggcg 

gtgtgcggag 

ctgccgtctt 

ggttccgacc 

gtcgtcagtt 

ggcggaacca 

aacctcaacc 

gcaagcggca 

gaaaaaaacg 

gtcgccgaag 

ctcgtcatcc 

taa 



gcctgatgtt 
cgcctgttgc 
tcaagtggcc 
cagtcagggc 
ccaaggaagt 
gcaatgccga 
aacaatatgc 
accacggcaa 
aaggcgtgga 
tcagcggcga 
attcggaacg 
aaaacggcac 
cggagtgggc 
aatccaaaaa 
ccgcacaggc 
gttggttcaa 
gcggcacggc 
tcatgcccgg 
gccccgccgg 
tgatggacgc 
gcgcgggcaa 
ccctgccgcc 
acgggctgct 



cagtttcgga 
ggcacttgcg 
aaacgatttg 
gggcggtaaa 
ggaaaacgcc 
tgccgccgta 
ggaagaaggg 
ggcggtattg 
cggacgaggc 
aatcagcctg 
ttttttgctg 
gttcgcaacc 
ggaaaaggcg 
ggcacaagtg 
tttgggcata 
cgccttgggc 
ggtaacggtt 
cttccacctg 
caaacgtt ac 
ggtttgcggc 
gcctgtcgat 
tgcatttttg 
gaacctgatt 



tgggcgtttg 
tgccggcgcg 
gtcgtcggac 
acggttgccg 
gcttccgtgc 
ttgctggaaa 
ttcgcgccat 
ctgttgcgcg 
gttctgcact 
cggcccgaca 
ttggaaggcg 
gtgggcagcg 
gatggaaatg 
aaggaacagc 
cgcaaccact 
agccgccgct 
gacgcgctca 
atgaaagaat 
cctttcccga 
tcgataatga 
gtcatcatta 
gcggaaaata 
gccgccgaag 



accgcccgca 
ctttggggtg 
gcgacaaatt 
tggtcggtat 
agtcgctgtt 
cattgcttgc 
ttttaaatga 
acggcgaaac 
tggaaacggc 
acaggtcggt 
ggaacagccg 
cgccgtaccg 
t ccgcatcgt 
tcgcccgaaa 
accgccaccc 
tcagccgcaa 
ccgatgacgg 
cgctcgccgt 
ccacaacggg 
tgatgcacgg 
ccggcggcgg 
ccgtgcgcgt 
gcggggaatc 



gtatgagttg 60 
tttgggtttg 120 
gggcggcatt 180 
cggcatcaat 240 
tcagacggca 300 
ggaactgggc 360 
gtatgaaacg 420 
cgtgtgcgaa 480 
agaaggcgaa 540 
ttccgtgccg 600 
gctcaagtgg 660 
cgatttgtcg 720 
cggttgcgcc 780 
aatcgagtgg 840 
cgaagaacac 900 
cgcctgcgtc 960 
acattatctc 1020 
ccgaaccgcc 1080 
caacgccgtc 3.140 
ccgtttgaaa 1200 
cgcggcgaaa 1260 
ggcggacaac 1320 
ggaacacgct 1380 

1383 



<210> 41 
<211> 244 
<212> PRT 

<213> Porphyromonas gingivalis 
<400> 41 

Met Ser Phe Asn Leu lie Val Asp Gin Gly Asn Ser Ala Cys Lys Val 
15 10 15 



Ala Phe Val Arg Asn Asn Ser lie Glu Ser lie Ser Phe Leu Pro Gly 

20 25 30 



Lys Ala Gly Gin Ala 



Leu Ser His Leu Val Ala 



Pro His Arg Phe Asp 
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: a 3 



35 40 45 

Lys Ala lie Tyr Ser Ser Val Gly Leu Pro Asp Glu Glu Ala Glu Ala 
50 55 60 

lie Val Arg Ser Cys Ala Ala Ala Ser Leu Met Met Gly Thr Glu Thr 
65 70 75 80 

Pro Val Pro Leu Arg Leu Gin Tyr Asp Arg Arg Thr Leu Gly Ala Asp 

85 90 95 

Arg Leu Ala Ala Val Val Gly Ala His Ser Leu Tyr Pro Asn Thr Glu 

100 105 110 

Leu Leu Val lie Asp Ala Gly Thr Ala lie Thr Tyr Glu Arg Val Ser 
115 120 125 

Ala Glu Gly lie Tyr Leu Gly Gly Asn lie Ser Pro Gly Leu His Leu 
130 135 140 

Arg Phe Lys Ala Leu His Leu Phe Thr Gly Arg Leu Pro Leu lie Asp 
145 150 155 160 

Pro Ser Gly lie Ser Pro Lys lie Ala Glu Tyr Gly Ser Ser Thr Giu 

165 170 175 

Glu Ala lie Thr Ala Gly Val lie His Gly Leu Ala Gly Glu lie Asp 

180 185 190 

Arg Tyr lie Asp Asp Leu His Ala Lys Glu Gly Arg Ser Ala Val lie 
195 200 205 

Leu Thr Gly Gly Asp Ala Asn Tyr Leu Ala Arg lie lie Arg Ser Gly 
210 215 220 

lie Leu lie His Pro Asp Leu Val Leu Leu Gly Leu Asn Arg lie Leu 
225 230 235 240 

Glu Tvr Asn Val 



<210> 42 

<211> 735 ; 
<212> DNA 

<213> Porphyromonas gingivalis 
<400> 42 

atgtccttca atctgatcgt cgatcaaggc aattctgcct gtaaggttgc tttcgtccga 60 
aataatagta tagagagcat ttcctttctg ccgggaaaag ccggacaggc actcagccat 120 
ctcgtcgctc ctcaccgttt cgacaaggct atctactcat ctgtggggct tcccgacgaa 180 
gaggctgaag ctattgtgag aagttgtgca gctgcttcct tgatgatggg gactgagacc 240 
cccgtacccc ttcgcctgca atatgatcgc cgcactttgg gtgccgaccg actggctgcg 300 
gtggtcggag cgcatagtct ctatccgaat accgaattgc tggtgatcga cgccggtact 360 
gccatcactt atgaacgagt atccgctgaa gggatctatc tcggtggcaa tatatcgccc 420 
ggtctccact tgcgcttcaa ggctcttcat ctctttacgg gcaggctccc cctgattgat 480 
ccttccggta tctctccgaa aatagccgag tatggctcct cgaccgaaga agcgatcaca 540 
gccggagtaa ttcatggcct ggcaggggag atagacagat atattgacga tctgcacgct 600 
aaagaggggc ggtctgccgt tatactgacc ggaggagatg ccaactattt ggcacggatt 660 
ataagaagcg gaatactaat tcatcccgat ttagtacttt tgggcctaaa tagaatttta 720 
gaatataatg tataa 735 
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<210> 43 
<211> 592 
<212> PRT 

<213> Neisseria meningitidis 



<400> 43 

Met Thr Val Leu Lys Leu Ser His Trp Arg Val Leu Ala Glu Leu Ala 
15 10 15 

Asp Gly Leu Pro Gin His Val Ser Gin Leu Ala Arg Met Ala Asp Met 

20 25 30 

Lys Pro Gin Gin Leu Asn Gly Phe Trp Gin Gin Met Pro Ala His He 
35 40 45 

Arg Gly Leu Leu Arg Gin His Asp Gly Tyr Trp Arg Leu Val Arg Pro 
50 55 60 

Leu Ala Val Phe Asp Ala Glu Gly Leu Arg Glu Leu Gly Glu Arg Ser 
65 70 75 80 

Gly Phe Gin Thr Ala Leu Lys His Glu Cys Ala Ser Ser Asn Asp Glu 

85 90 95 

He Leu Glu Leu Ala Arg He Ala Pro Asp Lys Ala His Lys Thr He 

100 105 HO 

Cys Val Thr His Leu Gin Ser Lys Gly Arg Gly Arg Gin Gly Arg Lys 
115 120 ■ 125 

Trp Ser His Arg Leu Gly Glu Cys Leu Met Phe Ser Phe Gly Trp Val 
130 135 140 

Phe Asp Arg Pro Gin Tyr Glu Leu Gly Ser Leu Ser Pro Val Ala Ala 
145 150 155 160 

Val Ala Cvs Arg Arg Ala Leu Ser Arg Leu Gly Leu Asp Val Gin He 

165 170 175 

Lys Trp Pro Asn Asp Leu Val Val Gly Arg Asp Lys Leu Gly Gly He 

180 185 190 

Leu lie Glu Thr Val Arg Thr Gly Gly Lys Thr Val Ala Val Val Gly 
195 200 205 

lie Gly He Asn Phe Val Leu Pro Lys Glu Val Glu Asn Ala Ala Ser 
210 215 220 

Val Gin Ser Leu Phe Gin Thr Ala Ser Arg Arg Gly Asn Ala Asp Ala 
225 230 235 240 

Ala Val Leu Leu Glu Thr Leu Leu Val Glu Leu Asp Ala Val Leu Leu 

245 250 255 

Gin Tyr Ala Arg Asp Gly Phe Ala Pro Phe Val Ala Glu Tyr Gin Ala 

260 265 270 

Ala Asn Arg Asp His Gly Lys Ala Val Leu Leu Leu Arg Asp Gly Glu 
275 280 285 
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Thr Val Phe Glu 
290 

His Leu Glu Thr 
305 

Ser Leu Arg Ser 



Ser Glu Arg Phe 

340 

Ala Trp Val Glu 
355 

Arg Asp Leu Ser 
370 

Asn Val Arg lie 
385 

Gin Val Gin Glu 



Ala Gin Ala Leu 

420 

Gly Ser Asp Arg 
435 

Asn Ala Cys Val 
450 

Leu Thr Asp Asp 
465 

His Leu Met Lys 



His Ala Gly Lys 

500 

Ala Ser Gly Met 
515 

Gly Arg Leu Lys 
530 

lie Thr Gly Gly 
545 

Phe Leu Ala Glu 



Gly Leu Leu Asn 

580 



Gly Thr Val Lys 
295 

Ala Glu Gly Lys 
310 

Asp Asp Arg Pro 
325 

Leu Leu Leu Asp 



Asn Gly Thr Phe 

360 

Pro Leu Gly Ala 
375 

Val Gly Cys Ala 
390 

Gin Leu Ala Arg 
405 

Gly lie Arg Asn 



Trp Phe Asn Ala 

440 

Val Val Ser Cys 
455 

Gly His Tyr Leu 
470 

Glu Ser Leu Ala 
485 

Arg Tyr Pro Phe 



Met Asp Ala Val 

520 

Glu Lys Thr Gly 
535 

Gly Ala Ala Lys 
550 

Asn Thr Val Arg 
565 

Met lie Ala Ala 



Gly Val Asp Gly 

300 

Gin Thr Val Val 
315 

Val Ser Val Pro 
330 

Gly Gly Asn Ser 
345 

Ala Thr Val Gly 



Glu Trp Ala Glu 

380 

Val Cys Gly Glu 
395 

Lys lie Glu Trp 
410 

His Tyr Arg His 
425 

Leu Gly Ser Arg 



Gly Thr Ala Val 

460 

Gly Gly Thr lie 
475 

Val Arg Thr Ala 
490 

Pro Thr Thr Thr 
505 

Cys Gly Ser Val 



Ala Gly Lys Pro 

540 

Val Ala Glu Ala 
555 

Val Ala Asp Asn 
570 

Glu Gly Arg Glu 
585 



Gin Gly Val Leu 



Ser Gly Glu lie 

320 

Lys Arg Arg Asp 
335 

Arg Leu Lys Trp 
350 

Ser Ala Pro Tyr 
365 

Lys Ala Asp Gly 



Phe Lys Lys Ala 

400 

Leu Pro Ser Ser 
415 

Pro Glu Glu His 
430 

Arg Phe Ser Arg 
445 

Thr Val Asp Ala 



Met Pro Gly Phe 

480 

Asn Leu Asn Arg 
495 

Gly Asn Ala Val 
510 

Met Met Met His 
525 

Val Asp Val lie 



Leu Pro Pro Ala 

560 

Leu Val lie Tyr 
575 

Tyr Glu His lie 
590 



<210> 44 
<2'll> 1779 
<212> DNA 
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840 
900 
960 



<213> Neisseria meningitidis 
<400> 44 

atgacggttt tgaagctttc gcactggcgg gtgttggcgg agcttgccga cggtttgccg 60 
caacacgtct cgcaactggc gcgtatggcg gatatgaagc cgcagcagct caacggtttt 120 
tggcagcaga tgccggcgca catacgcggg ctgttgcgcc aacacgacgg ctattggcgg 180 
ctggtgcgcc cattggcggt tttcgatgcc gaaggtttgc gcgagctggg ggaaaggtcg 240 
ggttttcaga cggcattgaa gcacgagtgc gcgtccagca acgacgagat actggaattg 300 
gcgcggattg cgccggacaa ggcgcacaaa accatatgcg tgacccacct gcaaagtaag 360 
ggcagggggc ggcaggggcg gaagtggtcg caccgtttgg gcgagtgtct gatgttcagt 420 
tttggctggg tgtttgaccg gccgcagtat gagttgggtt cgctgtcgcc tgttgcggca 480 
gtggcgtgtc ggcgcgcctt gtcgcgttta ggtttggatg tgcagattaa gtggcccaat 540 
gatttggttg tcggacgcga caaattgggc ggcattctga ttgaaacggt caggacgggc 600 
ggcaaaacgg ttgccgtggt cggtatcggc atcaattttg tcctgcccaa ggaagtagaa 660 
aatgccgctt ccgtgcaatc gctgtttcag acggcatcgc ggcggggcaa tgccgatgcc 720 
gccgtgctgc tggaaacgct gttggtggaa ctggacgcgg tgttgttgca atatgcgcgg 780 
gacggatttg cgccttttgt ggcggaatat caggctgcca accgcgacca cggcaaggcg 
gtattgctgt tgcgcgacgg cgaaaccgtg ttcgaaggca cggttaaagg cgtggacgga 
caaggcgttt tgcacttgga aacggcagag ggcaaacaga cggtcgtcag cggcgaaatc 
agcctgcggt ccgacgacag gccggtttcc gtgccgaagc ggcgggattc ggaacgtttt 1020 
ctgctgttgg acggcggcaa cagccggctc aagtgggcgt gggtggaaaa cggcacgttc 1080 
gcaaccgtcg gtagcgcgcc gtaccgcgat ttgtcgcctt tgggcgcgga gtgggcggaa 1140 
aaggcggatg gaaatgtccg catcgtcggt tgcgctgtgt gcggagaatt caaaaaggca 1200 
caagtgcagg aacagctcgc ccgaaaaatc gagtggctgc cgtcttccgc acaggctttg 1260 
ggcatacgca accactaccg ccaccccgaa gaacacggtt ccgaccgctg gttcaacgcc 1320 
ttgggcagcc gccgcttcag ccgcaacgcc tgcgtcgtcg tcagttgcgg cacggcggta 1380 
acggttgacg cgctcaccga tgacggacat tatctcgggg gaaccatcat gcccggtttc 1440 
cacctgatga aagaatcgct cgccgtccga accgccaacc tcaaccggca cgccggtaag 1500 
cgttatcctt tcccgaccac aacgggcaat gccgtcgcca gcggcatgat ggatgcggtt 1560 
tgcggctcgg ttatgatgat gcacgggcgt ttgaaagaaa aaaccggggc gggcaagcct 1620 
gtcgatgtca tcattaccgg cggcggcgcg gcaaaagttg ccgaagccct gccgcctgca 1680 
tttttggcgg aaaataccgt gcgcgtggcg gacaacctcg tcatttacgg gttgttgaac 1740 
atgattgccg ccgaaggcag ggaatatgaa catatttaa 1779 

<210> 45 
<211> 262 
<212> PRT 

<213> Bacillus anthracis 
<400> 45 

Met lie Phe Val Leu Asp Val Gly Asn Thr Asn Ala Val Leu Gly Val 
15 10 15 

Phe Glu Glu Gly Glu Leu Arg Glh His Trp Arg Met Glu Thr Asp Arg 

20 25 30 

His Lys Thr Glu Asp Glu Tyr Gly Met Leu Val Lys Gin Leu Leu Glu 
35' 40 45 

His Glu Gly Leu Ser Phe Glu Asp Val Lys Gly lie lie Val Ser Ser 
50 55 60 

Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Glu Lys Tyr 
65 70 75 . 80 

Phe Lys lie Lys Pro Leu Val Val Gly Pro Gly lie Lys Thr Gly Leu 

85 90 95 



Asn lie Lys Tyr Glu Asn Pro Arg Glu Val Gly Ala Asp Arg lie Val 

100 105 110 
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Asn Ala Val Ala Gly 
115 

Asp Phe Gly Thr Ala 
130 

Tyr Met Gly Gly Val 
145 

Leu Tyr Ser Arg Ala 

165 

Ser Ser Val Val Gly 

180 

Leu Tyr Gly Tyr Val 
195 

Glu Glu Ala Lys Gin 
210 

Lys Leu lie Ser Glu 
225 

Leu Thr Leu Lys Gly 

245 

Gin His Glu Lys Gly 

260 



lie His Leu Tyr Gly 
120 

Thr Thr Tyr Cys Tyr 
135 

lie Thr Pro Gly lie 
150 

Ala Lys Leu Pro Arg 

170 

Lys Asn Thr Val Ser 

185 

Gly Gin Val Glu Gly 
200 

Glu Pro Lys Val lie 
215 

Glu Ser Asn Val lie 
230 

Leu Tyr Met Leu Tyr 

250 

Glu 



Ser Pro Leu lie lie Val 
125 

lie Asn Glu Glu Lys His 
140 

Met lie Ser Ala Glu Ala 
155 160 

lie Glu lie Thr Lys Pro 

175 

Ala Met Gin Ser Gly lie 

190 

lie Val Lys Arg Met Lys 
205 

Ala Thr Gly Gly Leu Ala 
220 

Asp Val Val Asp Pro Phe 
235 240 

Glu Arg Asn Ala Asn Leu 

255 



<210> 46 
<211> 789 
<212> DNA 

<213> Bacillus anthracis 
<400> 46 

atgatttttg tattggatgt agggaacaca aatgctgtac tgggcgtgtt tgaagagggg 60 
gaacttcgtc aacattggcg catggaaaca gatcgtcata agacagaaga tgaatatgga 120 
atgcttgtga agcagttgct tgagcatgag ggtctttcgt ttgaagatgt gaaaggtatt 180 
atcgtatctt cagtcgtgcc accaattatg tttgctttag agcgcatgtg tgaaaagtat 240 
tttaaaatta agccgcttgt agtaggtcct ggaataaaaa cggggctaaa tattaaatat 300 
gaaaatccac gtgaagtagg tgcggatcga atcgtaaatg cagtagcagg gatccactta 360 
tatggaagtc cgcttattat tgtcgatttt ggtacggcta ctacatattg ttatattaac 420 
gaagaaaagc attatatggg tggagttatt acaccgggaa ttatgatttc agcagaggct 480 
ttatatagta gagccgcaaa acttcctcgt attgaaatta caaaaccaag cagtgtagtt 540 
gggaagaata cggtaagtgc gatgcaatct ggtattcttt atggttatgt tggacaagtg 600 
gaaggtattg ttaagcgcat gaaagaggaa gctaaacaag aaccgaaagt tattgcaaca 660 
ggtggattgg cgaaattaat ttcagaagaa tcgaatgtga ttgatgttgt agatccattt 720 
ttaacattaa aaggtttgta tatgttatac gagcggaatg caaatttaca gcatgagaaa 780 
ggtgaataa 1S9 

<210> 47 
<211> 254 
<212> PRT 

<213> Bacillus halodurans 
<400> 47 

Met lie Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
1 5 10 15 
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Tyr Gin Asp Glu Thr Leu Val His His Trp Arg Leu Ala Thr. Ser Arg 

. 20 25 30 

Gin Lys Thr Glu Asp Glu Tyr Ala Met Thr Val Arg Ser Leu Phe Asp 
35 40 45 

His Ala Gly Leu Gin Phe Gin Asp lie Asp Gly lie Val lie Ser Ser 
50 55 60 

Val Val Pro Pro Met Met Phe Ser Leu Glu Gin Met Cys Lys Lys Tyr 
65 70 75 80 

Phe His Val Thr Pro Met lie lie Gly Pro Gly lie Lys Thr Gly Leu 

85 90 95 

Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie Val 

100 105 110 

Asn Ala Val Ala Ala lie Glu Leu Tyr Gly Tyr Pro Ala lie Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Leu lie Asn Glu Lys Lys Gin 
130 135 140 

Tyr Ala Gly Gly Val lie Ala Pro Gly lie Met lie Ser Thr Glu Ala 
145 150 155 160 

Leu Tyr His Arg Ala Ser Lys Leu Pro Arg lie Glu lie Ala Lys Pro 

165 170 175 

Lys Gin Val Val Gly Thr Asn Thr lie Asp Ser Met Gin Ser Gly lie 

180 185 190 

Phe Tyr Gly Tyr Val Ser Gin Val Asp Gly Val Val Lys Arg Met Lys 
195 200 205 

Ala Gin Ala Glu Ser Glu Pro Lys Val lie Ala Thr Gly Gly Leu Ala 
210 215 220 

Lys Leu lie Gly -Thr Glu Ser Glu Thr lie Asp Val lie Asp Ser Phe 
225 230 235 240 

Leu Thr Leu Lys Gly Leu Gin Leu lie Tyr Lys Lys Asn Val 

245 250 



<210> 48 
<211> 765 
<212> DNA 

<213> Bacillus halodurans 
<400> 48 

atgatacttg tcattgatgt tggaaataca aatactgtgt taggggtcta ccaagatgaa 60 
acgttagtgc atcattggcg gctagcgacg agtaggcaaa agaccgagga tgagtatgca 120 
atgacggtgc gttctctctt tgatcatgca ggtctacagt ttcaagacat agacggcatt 180 
gtcatttcat ctgttgtccc accgatgatg ttttccttag agcaaatgtg caaaaaatac 240 
tttcatgtca ctcctatgat tattgggcct ggaattaaga caggcttaaa tattaagtat 300 
gacaatccaa aagaggttgg ggccgatcga atcgttaatg cagttgcagc gattgagtta 360 
tatggctacc ctgccattgt cgttgatttt ggaacagcaa caacatattg cttaattaat 420 
gaaaaaaaac aatatgcagg gggagtcatt gctcctggaa tcatgatctc aacagaagcg 480 
ttgtatcatc gcgcatcaaa attgccacgg attgaaatag cgaagccgaa acaagtcgta 540 
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gggacaaata cgattgattc gatgcaatca ggaatcttct acgggtatgt gagccaagtc 600 

gatggtgttg tgaaacgaat gaaggctcaa gcagaaagtg aaccgaaagt cattgcaact 660 

ggtgggcttg cgaagttaat cggaaccgag tcggaaacca ttgatgtaat cgattcgttt 720 

ttaacattaa aaggattgca actcatttat aagaagaatg tctga 765 

<210> 49 
<211> 258 
<212> PRT 

<213> Bacillus stearothermophilus 
<400> 49 

Met lie Phe Val Leu Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

Tyr Asp Gly Asp Glu Leu Lys His His Trp Arg lie Glu Thr Ser Arg 

20 25 30 

Ser Lys Thr Glu Asp Glu Tyr Gly Met Met lie Lys Ala Leu Leu Asn 
35 40 45 

His Val Gly Leu Gin Phe Ser Asp lie Arg Gly lie lie lie Ser Ser 
50 55 60 

Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Leu Lys Tyr 
65 70 75 80 

Phe His lie Lys Pro Leu lie Val Gly Pro Gly lie Lys Thr Gly Leu 

85 90 95 

Asp lie Lys Tyr Asp Asn Pro Arg Glu Val Gly Ala Asp Arg lie Val 

100 105 * 110 

Asn Ala Val Ala Gly lie His Leu Tyr Gly Ser Pro Leu lie lie Val 
115 120 * 125 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asn Glu His Lys Gin 
130 135 140 

Tyr Met Gly Gly Ala lie Ala Pro Gly lie Met lie Ser Thr Glu Ala 
145 150 155 160 

Leu Phe Ala Arg Ala Ala Lys Leu Pro Arg lie Glu lie Ala Arg Pro 

165 170 175 

Asp Asp lie lie Gly Lys Asn Thr Val Ser Ala Met Gin Ala Gly lie 

180 185 190 

Leu Tyr Gly Tyr Val Gly Gin Val Glu Gly lie Val Ser Arg Met Lys 
195 200 205 

Ala Lys Ser Lys lie Pro Pro Lys Val lie Ala Thr Gly Gly Leu Ala 
210 215 220 

Pro Leu lie Ala Ser Glu Ser Asp lie lie Asp Val Val Asp Pro Phe 
225 230 235 240 

Leu Thr Leu Thr Gly Leu Lys Leu Leu Tyr Glu Lys Asn Thr Glu Lys 

245 250 255 



Lys Gly 
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<210> 50 
<211> 777 
<212> DNA 

<213> Bacillus stearothermophilus 



<400> 50 

atgatttttg tattggacgt cggcaataca aacacggtgt taggggtgta tgacggggac 60 
gaactgaaac atcattggcg cattgaaaca agccgctcga aaacggaaga cgaatacggc 120 
atgatgatca aagcgctctt gaaccatgtc ggcttgcagt tttccgacat tcgaggcatc 180 
atcatttcct cggtcgtgcc gccgattatg tttgctcttg aacgcatgtg tctaaaatat 240 
ttccatatca aaccgctcat cgtcggtccg ggcattaaaa ccgggctcga catcaaatat 300 
gacaatccgc gtgaggtggg cgccgaccgg attgtcaacg cggttgccgg catccatttg 360 
tacggcagtc cgctgattat cgtcgatttt ggcacggcga cgacgtattg ttatattaat 420 
gaacataaac aatatatggg aggggccatt gccccgggaa ttatgatctc gacagaggct 480 
ctgtttgcgc gggcggcgaa attgccgcgc attgaaatcg cccgcccgga tgatatcatc 540 
ggcaaaaata cggtcagcgc catgcaagcc ggtattttat acggttatgt cggacaagtg 600 
gaaggcatcg tgtcgcgaat gaaggcgaaa agcaaaatcc cgccgaaggt gattgctact 660 
ggcggtttgg ctccgctcat tgccagcgaa tcggacatca tcgatgtcgt tgatccgttt 720 
ttgacgctga ctggcttaaa attgttgtac gagaaaaaca ccgagaaaaa aggatga 777 



<210> 51 
<211> 260 
<212> PRT 

<213> Caulobacter crescentus 



<400> 51 
Met Leu Leu Ala 
1 

His Asp Gly Ala 

20 

Thr Arg Thr Ala 
35 

Met Gin Gly Leu 
50 

Val Val Pro Gin 
65 

Phe Asn Val Glu 



Asp Val Arg lie 

100 

Asn Ala lie Gly 
115 

Asp Ser Gly Thr 
130 

Phe Glu Gly Gly 
145 

Leu His Glu Ala 



lie Glu Gin Gly 
5 

Ser Trp Val Ala 



Asp Glu Tyr Val 

40 

Gly Phe Arg Ala 
55 

Ser lie Phe Asn 
70 

Pro Leu Val lie 
85 

Glu Lys Pro Ser 



Ala Ala Met Val 

120 

Ala Thr Thr Phe 
135 

lie lie Ala Pro 
150 

Ala Ala Lys Leu 
165 



Asn Thr Asn Thr 
10 

Gin Trp Arg Ser 
25 

Val Trp Leu Ser 



lie Asp Ala Val 

60 

Leu Arg Asn Leu 
75 

Gly Glu Asn Ala 
90 

Glu Ala Gly Ala 
105 

Tyr Pro Gly Pro 



Asp lie Val Ala 

140 

Gly lie Asn Leu 
155 

Pro Arg lie Ala 
170 



Met Phe Ala lie 
15 

Ala Thr Glu Ser 
30 

Gin Leu Leu Ser 
45 

lie lie Ser Ser 



Ser Arg Arg Tyr 

80 

Lys Leu Gly lie 
95 

Asp Arg Leu Val 
110 

Leu Val Val He 
125 

Ala Asp Gly Ala 



Ser Met Gin Ala 

160 

He Gin Arg Pro 
175 
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Ala Gly Asn Arg lie 

180 

Gly Val Phe Trp Gly 
195 

He Lys Ala Glu Arg 
210 

Val Ala Ser Leu Phe 
225 

Ser Asp Leu Thr He 

245 

He Ala Glu Ser 

260 



Val Gly Thr Asp Thr 

185 

Tyr He Ser Leu He 
200 

Gly Glu Pro Met Thr 
215 

Glu Gly Ala Thr Asp 
230 

Arg Gly Leu Leu Glu 

250 



Val Ser Ala Met Gin Ser 

190 

Glu Gly Leu Val Ala Arg 
205 

Val He Ala Thr Gly Gly 
220 

Ser He Asp His Phe Asp 
235 240 

He Tyr Arg Arg Asn Thr 

255 



<210> 52 
<211> 783 
<212> DNA 

<213> Caulobacter crescentus 
<400> 52 

atgctgctgg ccattgagca gggcaacacc aacaccatgt tcgccattca tgatggcgca 60 
tcgtgggtcg cgcagtggcg gtcagcgacc gaaagcacgc gcacggccga tgagtacgtc 120 
gtctggcttt cgcaactgct gtcgatgcag gggcttggct tccgggcgat cgacgccgtg 180 
atcatttcca gcgtcgtgcc gcagtcgatc ttcaatctgc gcaacctgag ccgccgctac 240 
ttcaacgtcg agcctctggt catcggtgag aacgccaagc tgggcattga tgtccgcatc 300 
gagaaaccct ccgaggccgg cgccgaccgc ctggtcaacg ccattggcgc ggcgatggtc 360 
tatccgggtc cgctggtcgt gatcgacagc ggcaccgcga cgacgttcga catcgtggcc 420 
gccgacggcg ccttcgaggg cgggattatc gcgcccggta tcaacctgtc gatgcaggct 480 
ctgcacgagg cggcggcgaa gctgccgcgc atcgccatcc agcgtcccgc cggtaacagg 540 
atcgtgggca cggacacggt ctccgccatg cagtccggcg tcttctgggg ctatatttcg 600 
ctgatcgaag gcctcgtcgc gcggatcaag gccgagcgcg gegagcctat gaccgttatc 660 
gccacgggtg gcgtcgcctc gctgttcgag ggcgcgaccg acagcattga ccacttcgac 720 
tctgatctga cgatccgggg tcttctcgaa atctaccgcc gaaacaccat cgccgagtcc 780 
tga 

<210> 53 
<211> 257 
<212> PRT 

<213> Chlorobium tepidum 
<400> 53 

Met Arg Leu Val Val Asp lie Gly Asn Thr Ser Thr Thr Leu Ala He 
1.5 10 15 

Phe Thr Gly Asp Glu Glu Pro Ser Val Glu Ser Val Pro Ser Ala Leu 

20 25 30 

Phe Ala Asp Ser Ser Thr Met Arg Glu Val Phe Gly Asn Met Ala Arg 
35 40 45 

Lys His Gly Glu Pro Gin Ala He Ala He Cys Ser Val Val Pro Ser 
50 55 60 



783 



Ala Thr Ala Val Gly Ser Ala Leu Leu Glu Ser Leu Phe Ser Val Pro 
65 70 75 80 
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Val Leu Thr lie 



Ala Thr Pro His 

100 

Ser Arg His Leu 
115 

Thr Ala lie Thr 
130 

Gly Leu lie Met 
145 

Arg Thr Ala Gin 



Leu Gly Arg Ser 

180 

Val Val Lys Gin 
195 

Val Arg Asp Phe 
210 

Asn Ser Arg lie 
225 

Glu Leu Ala Val 



Pro 



Cys Cys Lys Leu 
85 

Thr Phe Gly Ala 



Phe Ser Glu Lys 

120 

Phe Asp Val Leu 
135 

Pro Gly lie Asp 
150 

Leu Pro Gin Val 
165 

Thr Thr Glu Cys 



lie Gly Gly Leu 

200 

Gly Glu Ser Thr 
215 

lie Val Pro Glu 
230 

Leu Arg Gly Ser 
245 



Arg Phe Pro Phe 
90 

Asp Arg Leu Ala 
105 

Pro Val lie Ala 



Asp Thr Val Gly 

140 

Met Met Ala Gly 
155 

Arg lie Asp Arg 
170 

lie Lys Ser Gly 
185 

Val Asp Ala lie 



Val Glu Val lie 

220 

lie Gly Pro Val 
235 

Asp Leu Leu Leu 
250 



Arg Leu Asp Tyr 
95 

Leu Cys Ala Trp 
110 

Val Asp lie Gly 
125 

Asn Tyr Arg Gly 



Ala Leu His Ser 

160 

Pro Glu Ser Leu 
175 

Val Phe Trp Gly 
190 

Arg Gly Asp Leu 
205 

Val Thr Gly Gly 



Ser Val lie Asp 

240 

Arg Met Asn Met 



<210> 54 
<211> 774 
<212> DNA 

<213> Chlorobium tepidum 
<400> 54 

gtgcggctgg tcgttgacat cggcaatacc agcacgacgt tggcgatttt caccggtgat 60 
gaagagccgt cggtcgagtc ggtaccgagt gcgttgtttg ccgattccag cacaatgcgc 120 
gaagtgtttg gcaacatggc ccggaagcac ggcgagccac aggccatcgc catttgcagc 180 
gtggtgcctt ccgctaccgc cgtcggttcg gcgcttctcg aatcactttt ctccgtaccg 240 
gtgctgacca tctgctgtaa gctccgtttt ccttttcgtc tcgactacgc aaccccgcac 300 
accttcggcg cggatcgcct tgccctgtgc gcatggagcc gacatctctt ttctgaaaaa 360 
ccggttatcg ccgtcgatat cggcacggcc atcaccttcg acgtgctcga cacggtgggg 420 
aattatcgcg gtggtctcat catgccgggt atcgacatga tggccggagc gcttcattcg 4 80 
agaaccgccc agcttcccca ggtgcgcatc gacaggccgg agagccttct cgggcgctcg 540 
acgaccgaat gcatcaaaag cggagttttc tggggagtgg tcaaacagat cggcggcctc 600 
gtggacgcca ttcgcggcga ccttgtacgc gactttggcg agtcaacggt cgaagtgatt 660 
gtcaccggcg gcaatagcag gattatcgtt ccggagatcg gccctgtcag tgttatcgac 720 
gaactcgctg tcctgcgcgg cagcgatctt ttgctgcgga tgaatatgcc gtga 774 

<210> 55 
<211> 256 
<212> PRT 

<213> Clostridium difficile 
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<400> 55 

Met Leu Leu Val Phe Asp Val Gly Asn Thr Asn Met Val Leu Gly lie 
15 10 15 

Tyr Lys Gly Asp Lys Leu Val Asn Tyr Trp Arg lie Lys Thr Asp Arg 

20 25 30 

Glu Lys Thr Ser Asp Glu Tyr Gly lie Leu lie Ser Asn Leu Phe Asp 
35 40 45 

Tyr Asp Asn Val Asn lie Ser Asp lie Asp Asp Val lie lie Ser Ser 
50 55 60 

Val Val Pro Asn Val Met His Ser Leu Glu Asn Phe Cys lie Lys Tyr 
65 70 75 80 

Cys Lys Lys Gin Pro Leu lie Val Gly Pro Gly lie Lys Thr Gly Leu 

85 90 95 

Asn lie Lys Tyr Asp Asn Pro Lys Gin Val Gly Ala Asp Arg lie Val 

100 105 110 

Asn Ala Val Ala Gly lie Glu Lys Tyr Gly Ala Pro Ser lie Leu Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Phe Cys Ala lie Ser Glu Lys Gly Glu 
130 135 140 

Tyr Leu Gly Gly Thr lie Ala Pro Gly lie Lys lie Ser Ser Glu Ala 
145 150 155 160 

Leu Phe Gin Ser Ala Ser Lys Leu Pro Arg Val Glu Leu Ala Lys Pro 

165 170 175 

Gly Met Thr lie Cys Lys Ser Thr Val Ser Ala Met Gin Ser Gly lie 

180 185 190 

lie Tyr Gly Tyr Val Gly Leu Val Asp Lys lie lie Ser lie Met Lys 
195 200 205 

Lys Glu Leu Asn Cys Asp Asp Val Lys Val lie Ala Thr Gly Gly Leu 
210 215 220 

Ala Lys Leu lie Ala Ser Glu Thr Lys Ser lie Asp Tyr Val Asp Gly 
225 230 235 240 

Phe Leu Thr Leu Glu Gly Leu Arg lie lie Tyr Glu Lys Asn Gin Glu 

245 250 255 



<210> 56 
<211> 771 
<212> DNA 

<213> Clostridium difficile 
<400> 56 

atgcttctag tatttgatgt tggaaatact 
aaattagtta attactggag aattaaaaca 
atcctgataa gtaacctatt tgattatgat 
ataatatcat ctgtagttcc gaatgttatg 



aatatggttt taggtatata taaaggtgac 60 
gatagggaaa aaacgtctga tgaatatgga 120 
aatgtgaata taagtgatat tgatgatgtt 180 
cattctcttg aaaacttttg tataaagtac 240 
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tgtaaaaaac agccattaat agtaggtcca ggcataaaaa caggtctaaa tataaaatat 300 
gataatccaa aacaagttgg ggcagataga atagttaatg ctgtagcagg gatagaaaag 360 
tatggagcac caagtatact tgttgatttt ggaacagcaa ctacattttg tgctatctct 420 
gaaaaaggtg aatatttggg tggaacaata gcaccaggaa taaaaatatc tagtgaggcg 480 
ttatttcaaa gtgcgtctaa attacctaga gtagaattag ctaagccagg tatgactatt 540 
tgtaagagta ctgtatcagc catgcaatct ggaataattt atggatatgt tggtttagtt 600 
gacaaaataa taagtattat gaagaaagaa ttgaattgtg atgatgttaa ggttatagct 660 
acaggtggat tagctaaact gattgcttca gagacgaaaa gtatagatta tgtagatggt 720 
tttttaacac tagaaggatt gagaataata tatgaaaaaa accaagaata a 771 

<210> 57 
<211> 219 
<212> PRT 

<213> Dehalococcoides ethenogenes 
<400> 57 

Met Ser Glu Lys Leu Val Ala Val Asp lie Gly Asn Thr Ser Val Asn 
1 5 10 15 

lie Gly lie Phe Glu Gly Glu Lys Leu Leu Ala Asn Trp His Leu Gly 

20 25 30 

Ser Val Ala Gin Arg Met Ala Asp Glu Tyr Ala Ser Leu Leu Leu Gly 
35 40 45 

Leu Leu Gin His Ala Gly lie His Pro Glu Glu Leu Asn Arg Val lie 
50 55 60 

Met Cys Ser Val Val Pro Pro Leu Thr Thr Thr Phe Glu Glu Val Phe 
65 70 75 80 

Lys Ser Tyr Phe Lys Ala Ala Pro Leu Val Val Gly Ala Gly lie Lys 

85 90 95 

Ser Gly Val Lys Val* Arg Met Asp Asn Pro Arg Glu Val Gly Ala Asp • 

100 105 110 

Arg lie Val Asn Ala Ala Ala Ala Arg Val Leu Tyr Pro Gly Ala Cys 
115 120 125 

lie lie Val Asp Met Gly Thr Ala Thr Thr Phe Asp Thr Leu Ser Glu 
130 135 140 

Gly Gly Ala Tyr lie Gly Gly Ala lie Ala Pro Gly lie Ala Thr Ser 
145 150 155 160 

Ala Gin Ala lie Ala Glu Lys Thr Ser Lys Leu Pro Lys lie Glu lie 

165 170 175 

lie Arg Pro Ala Lys Val lie Gly Ser Asn Thr Val Ser Ala Met Gin 

180 185 190 

Ser Gly lie Tyr Phe Gly Tyr lie Gly Leu Val Glu Glu Leu Val Arg 
195 200 205 

Arg lie Gin Thr Glu Leu Gly Gin Lys Thr Arg 
210 215 



<210> 58 
<211> 659 



OGZ-001 



-42- 



<212> DNA 

<213> Dehalococcoides ethenogenes 
<400> 58 

atgtctgaaa aactggtggc ggtagatatc ggcaatacca gcgtaaatat aggtatattt 60 
gagggcgaaa aactgctggc aaactggcat ctgggttcgg ttgcccagcg tatggctgat 120 
gaatatgcca gtctgctctt aggcctgttg cagcacgccg gtatacaccc ggaagagcta 180 
aacagggtaa tcatgtgcag tgttgtgccg cccctgacca ctacttttga agaggtattt 240 
aaaagctatt tcaaggctgc tcctctggta gtgggtgcag gtataaagag cggggttaag 300 
gtgcgcatgg ataacccccg tgaggttggg gctgaccgca tagtaaatgc cgctgccgcc 360 
agggtgcttt atccgggggc gtgcataata gtggacatgg gtacggccac tacctttgat 420 
accctttccg agggtggggc atatataggc ggggcgattg cacccggtat tgccacctca 480 
gcccaggcta ttgcggaaaa gacttcaaaa ctgcccaaga ttgagataat ccgtcctgcc 540 
aaagttatcg gctctaatac tgtgtcggct atgcagtcag gtatatactt cggttatatc 600 
gggctggtgg aagagctggt caggcggatt caaactgaat tggggcagaa aaccagagt 659 

<210> 59 
<211> 212 
<212> PRT 

<213> Desulf ovibrio vulgaris 
<400> 59 

Met Thr Gin His Phe Leu Leu Phe Asp lie Gly Asn Thr Asn Val Lys 



5 10 15 



He Gly He Ala Val Glu Thr Ala Val Leu Thr Ser Tyr Val Leu Pro 

20 25 30 

Thr Asp Pro Gly Gin Thr Thr Asp Ser lie Gly Leu Arg Leu Leu Glu 
35 40 45 

Val Leu Arg His Ala Gly Leu Gly Pro Ala Asp Val Gly Ala Cys Val 
50 55 60 

Ala Ser Ser Val Val Pro Gly Val Asn Pro Leu He Arg Arg Ala Cys 
65 70 75 80 

Glu Arg Tyr Leu Tyr Arg Lys Leu Leu Phe Ala Pro Gly Asp lie Ala 

85 90 95 

He Pro Leu Asp Asn Arg Tyr Glu Arg Pro Ala Glu Val Gly Ala Asp 

100 105 110 

Arg Leu Val Ala Ala Tyr Ala Ala Arg Arg Leu Tyr Pro Gly Pro Arg 
115 120 125 

Ser Leu Val Ser Val Asp Phe Gly Thr Ala Thr Thr Phe Asp Cys Val 
130 135 140 

Glu Gly Gly Ala Tyr Leu Gly Gly Leu He Cys Pro Gly Val Leu Ser 
145 150 155 160 

Ser Ala Gly Ala Leu Ser Ser Arg Thr Ala Lys Leu Pro Arg lie Ser 

165 170 175 

Leu Glu Val Glu Glu Asp Ser Pro Val He Gly Arg Ser Thr Thr Thr 

180 185 190 



Ser Leu Asn His Gly Phe He Phe Gly Phe Ala Ala Met Thr Glu Gly 
195 200 205 
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Val Leu Ala Ala 
210 



60 



<210> 60 
<211> 639 
<212> DNA 

<213> Desulf ovibrio vulgaris 
<400> 60 

atgacccagc atttcctgct gttcgacatc ggcaacacca acgtcaagat cggcatcgcg 
gtggaaaccg ccgtgctgac ttcgtacgtg ctgcccacag accccggcca gacgaccgac 120 
tccatcgggc tgcgcctgct ggaggtgctg cgccatgccg ggctgggacc ggcggacgtg 180 
ggggcctgcg tggccagttc ggtggtgccc ggcgtcaacc cgctgatccg ccgcgcctgc 240 
gaacgttacc tgtatcgcaa gctgctgttc gcccccggcg acatcgccat tccgctggac 300 
aaccgctacg aacggcccgc cgaagtgggc gcggaccggc tggtggcggc ctatgccgcc 360 
cggcggctgt accccggccc ccggtcgctg gtatccgtgg atttcggcac cgccaccacg 420 
tttgactgcg tggaaggggg tgcgtatctt ggtggtttga tctgtcccgg cgtgctgtcg 480 
tccgccgggg cgttgtcgtc gcgcacggcc aagctgccgc gcatcagtct ggaagtggaa 540 
gaggattcgc cggtcatcgg gcggtccacc accaccagcc tgaaccacgg cttcattttc 600 
ggctttgccg ccatgaccga aggggtgctg gccgcctga 639 

<210> 61 
<211> 249 
<212> PRT 

<213> Pseudomonas putida 
<400> 61 

Met lie Leu Glu Leu Asp Cys Gly Asn Ser Phe lie Lys Trp Arg Val 
15 10 15 

He His Val Ala Asp Ala Val He Glu Gly Gly Gly He Val Asp Ser 

20 2 5 30 

Asp Gin Ala Leu Val Ala Glu Val Ala Ala Leu Ala Ser Val Arg Leu 
35 40' 45 

Thr Gly Cys Arg He Val Ser Val Arg Ser Glu Glu Glu Thr Asp Ala 
50 55 60 

Leu Cys Ala Leu He Ala Gin Ala Phe Ala Val Gin Ala Lys Val Ala 
65 70 75 80 

His Pro Val Arg Glu Met Ala Gly Val Arg Asn Gly Tyr Asp Asp Tyr 

85 90 95 

Gin Arg Leu Gly Met Asp Arg Trp Leu Ala Ala Leu Gly Ala Phe His 

100 105 110 

Leu Ala Lys Gly Ala Cys Leu Val He Asp Leu Gly Thr Ala Ala Lys 
115 120 125 

Ala Asp Phe Val Ser Ala Asp Gly Glu His Leu Gly Gly Tyr He Cys 
130 135 140 

Pro Gly Met Pro Leu Met Arg Ser Gin Leu Arg Thr His Thr Arg Arg 
145 150 155 160 

He Arg Tyr Asp Asp Ala Ser Ala Glu Arg Ala Leu Ser Ser Leu Ser 

165 170 175 



Pro Gly Arg Ser Thr Val Glu Ala Val Glu Arg Gly Cys Val Leu Met 
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180 185 190 

Leu Gin Gly Phe Ala Tyr Thr Gin Leu Glu Gin Ala Arg Val Leu Trp 
195 200 205 

Gly Glu Glu Phe Thr Val Phe Leu Thr Gly Gly Asp Ala Pro Leu Val 
210 215 220 

Arg Ala Ala Leu Pro Gin Ala Arg Val Val Pro Asp Leu Val Phe Val 
225 230 235 240 

Gly Leu Ala Met Ala Cys Pro Leu Asp 

245 



<210> 62 
<211> 750 
<212> DNA 

<213> Pseudomonas putida 
<400> 62 

atgattcttg agctcgattg cggtaacagc ttcatcaagt ggcgggtgat ccatgttgcc 60 
gatgctgtga ttgaaggtgg tgggatcgtc gattccgatc aggcgctggt ggcggaagtg 120 
gctgcgctcg cttcagtgcg tctcacgggt tgccgtattg tcagtgtgcg cagcgaagaa 180 
gagaccgatg cgctttgcgc gttgattgct caggcatttg ccgtgcaggc gaaggttgcc 240 
caccctgtcc gtgaaatggc aggtgtgcgc aatggctatg acgactatca gcgcctgggt 300 
atggatcgtt ggctggcggc gttgggggca tttcacctgg ccaagggcgc gtgcctggtg 360 
attgacctgg gtaccgcggc aaaagcggac ttcgtttctg cagatggcga gcatcttggg 420 
ggctacatct gcccaggtat gccattgatg cgtagccagc tgcgcactca cacccgtcgg 480 
atccgctatg acgatgcctc cgcggagcgc gcattgagca gcttgtcacc aggtcgctcg 540 
actgtcgaag cggtagagcg cggttgcgta ttgatgctcc agggctttgc ctacacccag 600 
cttgagcagg ctcgtgtgct atggggtgag gagttcaccg tgttcctcac tggcggtgat 660 
gcgccactgg tgagggcggc cctgccacag gcgcgggtcg tgcctgacct ggttttcgtt 720 
ggcctggcaa tggcttgtcc attggattga 750 

<210> 63 
<211> 241 
<212> PRT 

<213> Thiobacillus ferrooxidans 
<400> 63 

Met lie Phe lie Ala Val Gly Asn Thr Arg Thr Leu Leu Ala His Thr 
15 10 15 

His Asp Gly Val His Phe Asp Ser Val Ser Val Ala Thr Ser Leu Pro 

20 25 30 

Pro Thr Glu lie Leu Gin Gin Pro Gly Leu Thr Trp Leu Ser Ala Pro 
35 40 45 

Asn Arg Glu Pro Val Ala Leu Gly Gly Val Val Pro Ala Ala Leu Ala 
50 55 60 

Ala Trp Arg Glu Ala Leu Ala Thr Ala Glu Val Arg Glu Pro Asp Pro 
65 70 75 80 

Gly Phe Phe Arg Arg Ala Val Pro His Asp Tyr His Pro Pro Glu Ser 

85 90 95 



Leu Gly Phe Asp Arg Arg Cys Cys Leu Leu Ala Ala Ala Met Asp Tyr 

100 105 HO 
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Pro Gly Gin Asp 
115 

Asp Leu Leu Ala 
130 

lie Ala Met Ser 
145 

Glu Val Val Leu 



Asn Ala lie Gin 

180 

Gly Ala lie Thr 
195 

lie Thr Gly Gly 
210 

Leu Tyr Gin Pro 
Gly 



Ser lie Val lie 

120 

Gly Gly His Phe 
135 

Leu Arg Gly Leu 
150 

Asn Ala Pro Ala 
165 

Ala Gly Val lie 



Asp Phe Arg Gin 

200 

Asp Ala Glu Arg 
215 

His Leu Leu Leu 
230 



Asp Met Gly Thr 



Arg Gly Gly Arg 

140 

His Glu Gly Thr 
155 

Glu Met Leu Gly 
170 

His Leu Phe Ala 
185 

Tyr Ser Pro Gin 



Trp Gin Pro Gly 

220 

Arg Gly Phe Tyr 
235 



Ala lie Thr lie 
125 

lie Leu Pro Gly 



Ala Leu Leu Pro 

160 

Asn Asp Thr Ser 
175 

Asp Ala Leu Arg 
190 

Ala Arg lie Leu 
205 

lie Ala Gly Ser 



Leu Trp lie Arg 

240 



<210> 64 
<211> 726 
<212> DNA 

<213> Thiobacillus f errooxidans 
<400> 64 

caatacccgc accctgctgg cacacaccca cgatggcgtg 60 
ggccacttcg ctgccaccca cggaaatcct gcagcagccc 120 
gccgaaccgg gaacccgtcg cgctgggcgg cgtcgtacct 180. 
ggaagccttg gccacggcag aggtccgcga acccgacccc 240 
gccgcacgac tatcatccgc cggaaagcct cggctttgac 300 
cgccgccatg gactaccccg gccaggacag catcgtcatc 360 
catcgacctg ctggctggcg gacatttccg gggcggacgc 420 
gagcctgcgc ggtctgcatg aaggcacggc actccttcct 480 
agcggaaatg ctgggcaatg acaccagcaa cgccattcag 540 
tgccgatgcc ctgcgcggcg ccattaccga ctttcgccag 600 
actgatcacc ggtggcgatg ccgaacgttg gcaacccggc 660 
gccccatctg cttctgcgcg gcttttatct gtggatacgg 720 
ggatga 126 

<210> 65 
<211> 242 
<212> PRT 

<213> Xylessa fastidiosa 
<400> 65 

Met Asn Asp Trp Leu Phe Asp Leu Gly Asn Ser Arg Phe Lys Cys Ala 
15 10 15 



atgatcttca 
catttcgaca 
ggcttgacat 
gcggcgcttg 
ggcttttttc 
cgccgttgct 
gacatgggca 
attctgccgg 
gaagtcgtcc 
gccggggtca 
tacagccccc 
atcgctggta 



tcgccgtcgg 
gcgtcagcgt 
ggctcagcgc 
ccgcctggcg 
gccgcgccgt 
gcctgctcgc 
ccgccatcac 
gtatcgccat 
tgaacgcccc 
tccacctctt 
aggcacggat 
gcctgtacca 



Ser Leu Arg Glu Gly Val lie Gly Pro Val Thr Val Leu Pro Tyr Leu 

20 25 30 
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Thr Glu Thr Met Asp Ala Phe Ala Leu Gin Glu Leu Pro Arg Gly Arg 
35 40 45 

Val Ala Tyr Leu Ala Ser Val Ala Ala Pro Ala lie Thr Thr His Val 
50 55 60 

Leu Glu Val Leu Lys lie His Phe Glu Gin Val Gin Val Ala Ala Thr 
65 70 75 80 

Val Ala Ala Cys Ala Gly Val Arg lie Ala Tyr Ala His Pro Glu Arg 

85 90 95 

Phe Gly Val Asp Arg Phe Leu Ala Leu Leu Gly Ser Tyr Gly Glu Gly 

100 105 110 

Asn Val Leu Val Val Gly Val Gly Thr Ala Leu Thr lie Asp Leu Leu 
115 120 125 

Ala Ala Asn Gly Cys His Leu Gly Gly Arg lie Ser Ala Ser Pro Thr 
130 135 140 

Leu Met Arg Gin Ala Leu His Ala Arg Ala Glu Gin Leu Pro Leu Ser 
145 150 155 160 

Gly Gly Asn Tyr Leu Glu Phe Ala Glu Asp Thr Glu Asp Ala Leu Val 

165 170 175 

Ser Gly Cys Asn Gly Ala Ala Val Ala Leu lie Glu Arg Ser Leu Tyr 

180 185 190 

Glu Ala His Gin Arg Leu Asp Gin Ser Val Arg Leu Leu Leu His Gly 
195 200 " 205 

Gly Gly Val Ala Ser Leu Leu Pro Trp Leu Gly Asp Val Val His Arg 
210 215 220 

Pro Thr Leu Val Leu Asp Gly Leu Ala lie Trp Ala Ala Val Ala Ala 
225 230 235 240 

Asn Val 



<210> 66 
<211> 729 
<212> DNA 

<213> Xylella fastidiosa 
<400> 66 

atgaatgatt ggttattcga tctaggtaat tcgcgtttta aatgtgcatc gctcagggaa 60 
ggtgtgattg gtcctgtaac ggttttgccg tacttaacag agaccatgga cgcgtttgcg 120 
ttacaggagc taccacgtgg tcgtgtggct tacttggcga gtgtcgctgc tccggctatt 180 
actacacatg tgctcgaagt attaaaaatc cacttcgagc aagtccaggt ggctgcaacc 240 
gtcgctgcat gtgccggagt acgaattgcc tatgctcacc cggaacgttt tggagtggat 300 
aggttcttag cgttgcttgg ttcgtatggt gagggcaatg tcctggtagt gggtgtcggg 360 
acagcattga ctattgattt gttggctgcc aatggttgtc atctcggagg gcgtatcagt 420 
gcttcaccga cattgatgcg ccaagcgttg catgcacgcg cggagcaact ccccctcagt 480 
ggtgggaact acttggagtt tgcggaagat acagaggatg cgttggtgtc agggtgcaat 540 
ggtgcagcgg tggcattgat cgaacgtagc ctgtatgagg cacatcaacg tttggaccag 600 
tcggttcgat tattgttgca tggtggaggt gtagcatctt tattgccttg gttgggcgac 660 
gtggtacatc gtcctacatt agtattggat ggcctggcga tctgggctgc cgttgcagct 720 
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aacgtttag 729 

<210> 67 
<211> 223 
<212> PRT 

<213> Helicobacter pylori 
<400> 67 

Met Pro Ala Arg Gin Ser Phe Thr Asp Leu Lys Asn Leu Val Leu Cys 
15 10 15 

Asp lie Gly Asn Thr Arg lie His Phe Ala Gin Asn Tyr Gin Leu Phe 

20 25 30 

Ser Ser Ala Lys Glu Asp Leu Lys Arg Leu Gly lie Gin Lys Glu lie 
35 40 45 

Phe Tyr lie Ser Val Asn Glu Glu Asn Glu Lys Ala Leu Leu Asn Cys 
50 55 60 

Tyr Pro Asn Ala Lys Asn lie Ala Gly Phe Phe His Leu Glu Thr Asp 
65 70 75 80 

Tyr Val Gly Leu Gly lie Asp Arg Gin Met Ala Cys Leu Ala Val Asn 

85 90 95 

Asn Gly Val Val Val Asp Ala Gly Ser Ala lie Thr lie Asp Leu lie 

100 105 110 

Lys Glu Gly Lys His Leu Gly Gly Cys lie Leu Pro Gly Leu Ala Gin 
115 120 125 

Tyr lie His Ala Tyr Lys Lys Ser Ala Lys lie Leu Glu Gin Pro Phe 
130 135 140 

Lys Ala Leu Asp Ser Leu Glu Val Leu Pro Lys Ser Thr Arg Asp Ala- 
145 150 155 160 

Val Asn Tyr Gly Met Val Leu Ser Val lie Ala Cys lie Gin His Leu 

165 170 175 

Ala Lys Asn Gin Lys lie Tyr Leu Cys Gly Gly Asp Ala Lys Tyr Leu 

180 185 190 

Ser Ala Phe Leu Pro His Ser Val Cys Lys Glu Arg Leu Val Phe Asp 
195 200 205 

Gly Met Glu lie Ala Leu Lys Lys Ala Gly lie Leu Glu Cys Lys 
210 215 220 



<210> 68 
<211> 672 
<212> DNA 

<213> Helicobacter pylori 
<400> 68 

atgccagcta ggcaatcttt tacagatttg 
acgcgtatcc attttgcaca aaactatcag 
cgtttgggta ttcaaaagga aattttttac 
cttttgaatt gttaccctaa cgctaaaaat 



aaaaacctgg ttttgtgcga tataggcaac 60 
ctcttttcaa gcgctaaaga agatttaaag 120 
attagcgtga atgaagaaaa tgaaaaagcc 180 
attgcagggt tttttcattt agaaaccgac 240 
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tatgtagggc ttgggataga ccggcaaatg gcgtgtctgg cggtaaataa tggcgtggtg 300 

gtggatgccg ggagtgcgat tacgatagat ttaatcaaag agggcaagca tttaggaggg 360 

tgtattttac ccggtttagc ccaatatatt catgcgtata aaaaaagcgc taaaatttta 420 

gagcaacctt tcaaggcctt agattcttta gaagttttac ctaaaagcac tagagacgct 480 

gtgaattacg gcatggtttt gagcgtcatt gcttgtatcc agcatttagc caaaaatcaa 540 

aaaatctatc tttgtggggg cgatgcgaag tatttgagcg cgtttttacc ccattctgtt 600 

tgcaaggagc gtttggtttt tgacgggatg gaaatcgctc ttaaaaaagc agggatacta 660 

gaatgcaaat ga 672 

<210> 69 
<211> 8320 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid, pAN296 



60 



<400> 69 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 
tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 

180 
240 
300 
360 



ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 
ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgctgg atgaaaagcc 
gatgaccgct tttcaggtct gtcagcagct ttttcctgct gtatatgaaa aggaattgtt 
tttaacgatg tcagaaacgg caggtcacct tgatgtgttg gaggctgaag aagccatcac 
gtcatattgg gaaggaaata ccgtatactt taaaacaatg aagaggtgaa atgggtgaaa 420 
catatagcgg gaaaaaggat ttggataacc ggcgcttcag gagggcttgg agaaagaatc 480 
gcatacttat gcgcggctga aggagcccat gtcctgctgt cggctagacg cgaggatcgt 540 
ttgatagaaa tcaaaaggaa aataaccgag gaatggagcg gacagtgtga gatttttcct 600 
ctggatgtcg gccgcctaga ggatatcgcc cgggtccgcg atcagatcgg ctcgattgat 660 
gtactgatta acaatgcagg cttcggtata tttgaaacgg ttttagactc tacattggat 720 
gacatgaaag cgatgtttga tgtgaatgtc ttcggcctga tcgcctgtac aaaagcggtg 780 
cttccgcaaa tgcttgagca aaaaaaggga catatcatca atatcgcctc tcaagcgggg 840 
aaaatcgcca caccgaagtc tagcctgtat tccgcgacca aacatgccgt gttaggttac 900 
tcaaacgctt tgcggatgga gctttcggga accggcattt atgtgacaac agtcaacccg 960 
ggcccgattc agacggactt tttttccatt gctgataaag gcggggacta cgccaaaaat 1020 
gtcggccgct ggatgcttga tcctgatgac gtggcagctc aaattacagc tgcaattttt 1080 
acgaaaaagc gggagatcaa tcttccgcgt ttaatgaatg ccggcactaa gctgtatcag 1140 
ctgtttccag ctcttgtaga aaagctggca ggacgcgcgc tcatgaaaaa ataatgatag 1200 
aactgcctgt ggtggagtgg cttgtttctc acggggcagt ttttgatagt ggaagggaga 1260 
gattgttgaa tgtcagttca ttcagaagtc cttcatgctc tgcttaaaga tccgtttatt 1320 
cagaaactga ttgatgcaga gcctgtattc tgggcaaatt caggcaagaa agaggggcca 1380 
ttaccccgtg cagatgagtg ggcaaccgag atagcggaag cggaaaaaag aatgcagcgg 1440 
tttgcacctt acattgccga ggtgtttcct gagacgaaag gcgctaaagg aatcatcgag 1500 
tctccgcttt ttgaggtgca gcatatgaag ggaaagctgg aagcggcata tcagcagcca 1560 
tttcccggaa gatggctttt aaagtgcgac catgagcttc cgatttcagg atcgattaaa 1620 
gcgaggggcg ggatttatga agtgttaaag tatgctgaaa atctcgcgct tcaagaagga 1680 
atgcttcagg aaaccgatga ttaccgcatc ttacaggaag agcggtttac cgggtttttc 1740 
tcccgctatt cgattgctgt cggttcgaca ggaaatctag gtttaagcat cggcatcatc 1800 
ggcgcggcac tcgggtttcg cgtgacagtg catatgtccg ccgatgctaa gcagtggaaa 1860 
aaggatctcc tccgccaaaa gggagtcact gttatggagt acgaaacaga ttacagtgaa 1920 
gcggtgaacg aagggagacg gcaggcggaa caagatccat tctgttattt tattgatgat 1980 
gaacattctc gtcagctgtt cttaggatat gctgttgctg caagccgatt aaaaacacag 2040 
cttgactgta tgaatataaa gccaagtctt gagacgccct tgtttgtgta tctgccgtgc 2100 
ggagtcggcg gaggaccggg cggtgtagca tttgggctga agcttttata cggagatgat 2160 
gttcatgtgt ttttcgcaga accaactcat tcaccttgta tgctgttagg gctttattca 2220 
ggacttcacg agaagatctc cgtccaggat atcggcctgg ataatcagac ggctgctgac 2280 
ggacttgccg tagggaggcc gtcaggattt gtcggcaagc tgattgaacc gcttctgagc 2340 
ggctgttata cggtagagga caatacgctt tatactttgc ttcatatgct ggctgtatct 2400 
gaagataaat atttagagcc ctctgctctt gctggcatgt tcgggccggt tcagcttttt 24 60 
tcgacagaag agggaaggcg ctatgctcag aaatataaga tggaacatgc cgtacatgtc 2520 
gtctggggaa cgggaggaag catggttcca aaagatgaaa tggctgcgta taaccgaatc 2580 
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ggtgctgatt tgctaaaaaa acgaaatgga aaataagcag acagtgaaaa ggttttccgt 2640 
tacaatcttt gtaagggttt taacctacag agagtcaggt gtaaacagtg aaaaataaag 2700 
aacttaacct acatacttta tatacacagc acaatcggga gtcttctgca gctcgagcaa 2760 
tagttaccct tattatcaag ataagaaaga aaaggatttt tcgctacgct caaatccttt 2820 
aaaaaaacac aaaagaccac attttttaat gtggtcttta ttcttcaact aaagcaccca 2880 
ttagttcaac aaacgaaaat tggataaagt gggatatttt taaaatatat atttatgtta 2940 
cagtaatatt gacttttaaa aaaggattga ttctaatgaa gaaagcagac aagtaagcct 3000 
cctaaattca ctttagataa aaatttagga ggcatatcaa atgaacttta ataaaattga 3060 
tttagacaat tggaagagaa aagagatatt taatcattat ttgaaccaac aaacgacttt 3120 
tagtataacc acagaaattg atattagtgt tttataccga aacataaaac aagaaggata 3180 
taaattttac cctgcattta ttttcttagt gacaagggtg ataaactcaa atacagcttt 3240 
tagaactggt tacaatagcg acggagagtt aggttattgg gataagttag agccacttta 3300 
tacaattttt gatggtgtat ctaaaacatt ctctggtatt tggactcctg taaagaatga 3360 
cttcaaagag ttttatgatt tatacctttc tgatgtagag aaatataatg gttcggggaa 3420 
attgtttccc aaaacaccta tacctgaaaa tgctttttct ctttctatta ttccatggac 3480 
ttcatttact gggtttaact taaatatcaa taataatagt aattaccttc tacccattat 3540 
tacagcagga aaattcatta ataaaggtaa ttcaatatat ttaccgctat ctttacaggt 3600 
acatcattct gtttgtgatg gttatcatgc aggattgttt atgaactcta ttcaggaatt 3660 
gtcagatagg cctaatgact ggcttttata atatgagata atgccgactg tactttttac 3720 
agtcggtttt ctaatgtcac taacctgccc cgttagttga agaaggtttt tatattacag 3780 
ctgtcgacta aggtcgagga agtgttggta aggagggtat gaaatgtgca tcatattgaa 3840 
ctgtatgtct ctgatttgga ggcgtctagg cggttttggg gctggttctt aaaagaactt 3900 
ggttataaag agtatcaaaa atggagctca ggcatcagct ggaagaaaga tcgtttttac 3960 
ctagtgattg tgcaggcgaa agagccattt ctagagccgg aataccatag atgccgagtc 4020 
ggtctgaacc atctcgcatt tcatgctgaa tccaagcttc aagtcgatca gatgactgaa 4080 
aaattgacgg caaaaggcta tcgtgtgttg taccgagaca ggcatccttt tgccggagga . 4 14 0 
gacgggcatt atgcagtctt ttgtgaggat ccagaccgga ttaaggtaga gctcgttgcc 4200 
ccaagctgtt aatcgtgatc ttcggacagg ctgttcagct ttttctcaat gcgatccagc 4260 
tgcgcttttc ggtttttcgc atacttgaag cctgtaacag ccgcaaagac gacagcggca 4320 
aatataataa atacaaacag ctgaaacatc acatcaccta tattcatgtt cttcacctca 4380 
tgtttgcggg agagattcat tctcttccgt tttttattta aagcggcttt tccagacggg 4440 
aacggtgttt tgtggtctcc attttcattt gccgataggc gaacgctaaa aatggcaggc 4500 
cgaacagggt aatgccgctc aggacagaaa aaatataaat cggccggcca gcgccaaaca 4560 
ggtctataca tatccccccg acccaagggc cgatgacgtt tccgagctgt ggaaaaccga 4 620 
ttgccccgaa ataagtgcct tttaatcctg gttttgcaat ctggtctaca tacaaatcca 4680 
rcatagagaa taaaagcact tcgccgattg taaatgtgat gacaatcatc acaattgatg 4740 
gaacaccgtg tgatacggtg aaaatggcca tgctgatgct aaccatcaca ttaccgagca 4800 
tcagagaaca aagcggcgaa aaccgttttg caaaatggac aatgggaaat tgcgtcgcca 4860. 
acacaacgat tgcgtttaat gtcagcatca gcccatacag cttcgttcca ttgccgatca 4920 
aggggttctg cgccatatac tgagggaatg tggaactgaa ttgtgagtag ccgaaggtgc 4 980 
atagcgtaat gccgaccaaa gcaatggtaa aaagataatc cttttgcgtg accataaacg 5040 
cttcccgcac gctcatattt cgggactggg ctggtgctga taaggatgga tgttttttaa 5100 
attggagggc aagcacaatt ccgtatagtc cgtaaatgac tgcaggcacc aaaaagggcg 5160 
tagtcgattg cgatgagccg aaatataggc caagcacagg tccgaagaca acgccgatat 5220 
taatagccgc atagcgtaaa ttaaaaacta gcagtctcgt tttttcttct gtcatatcag 5280 
acaacaaggc ctttgaagcg ggctcaaaca gtgatttgca aagaccgttt aatgcgttta 5340 
ctacaaaaaa cacccagaga ttagatgctg ccgcaaagcc tgcaaatacc agcatccatc 5400 
cgaaaatcga tacaagcatc atgttttttc tgccgaattt atctgagata tatccgccgt 54 60 
aaaagcttgc gaggatgccg actgatgagc tcgcggcgat gaccagccct gcataggaag 5520 
ctgatgcgcc ttggacggct gtcaaataaa tcgctaaaaa aggaatgctc atcgatgttg 5580 
ccattctgcc gaaaatggtt ccgattataa ttgtaacgcg ttggatgcat agcttgagta 5640 
ttctatagtg tcacctaaat agcttggcgt aatcatggtc atagctgttt cctgtgtgaa 5700 
attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag tgtaaagcct 5760 
ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg cccgctttcc 5820 
agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg 5880 
gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc 5940 
ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag 6000 
gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 6060 
aggccgcgtt gctggcgttt ttcgataggc tccgcccccc tgacgagcat cacaaaaatc 6120 
gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc 6180 
ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg 6240 



OGZ-001 



-50- 



cctttctccc ttcgggaagc gtggcgcttt 
cggtgtaggt cgttcgctcc aagctgggct 
gctgcgcctt atccggtaac tatcgtcttg 
cactggcagc agccactggt aacaggatta 
agttcttgaa gtggtggcct aactacggct 
ctctgctgaa gccagttacc ttcggaaaaa 
ccaccgctgg tagcggtggt ttttttgttt 
gatctcaaga agatcctttg atcttttcta 
cacgttaagg gattttggtc atgagattat 
attaaaaatg aagttttaaa tcaatctaaa 
accaatgctt aatcagtgag gcacctatct 
ttgcctgact ccccgtcgtg tagataacta 
gtgctgcaat gataccgcga gacccacgct 
agccagccgg aagggccgag cgcagaagtg 
ctattaattg ttgccgggaa gctagagtaa 
ttgttggcat tgctacaggc atcgtggtgt 
gctccggttc ccaacgatca aggcgagtta 
ttagctcctt cggtcctccg atcgttgtca 
tggttatggc agcactgcat aattctctta 
tgactggtga gtactcaacc aagtcattct 
cttgcccggc gtcaatacgg gataatagtg 
tcattggaaa acgttcttcg gggcgaaaac 
gttcgatgta acccactcgt gcacccaact 
tttctgggtg agcaaaaaca ggaaggcaaa 
ggaaatgttg aatactcata ctcttccttt 
attgtctcat gagcggatac atatttgaat 
cgcgcacatt tccccgaaaa gtgccacctg 
aaggagaaaa taccgcatca ggcgaaattg 
taaatatttg ttaaatcagc tcatttttta 
ataaatcaaa agaatagacc gagatagggt 
cactattaaa gaacgtggac tccaacgtca 
gcccactacg tgaaccatca cccaaatcaa 
taaatcggaa ccctaaaggg agcccccgat 
tggcgagaaa ggaagggaag aaagcgaaag 
cggtcacgct gcgcgtaacc accacacccg 

<210> 70 
<211> 6688 
<212> DNA 

<213> Artificial Sequence 



ctcatagctc acgctgtagg tatctcagtt 6300 
gtgtgcacga accccccgtt cagcccgacc 6360 
agtccaaccc ggtaagacac gacttatcgc 6420 
gcagagcgag gtatgtaggc ggtgctacag 6480 
acactagaag gacagtattt ggtatctgcg 6540 
gagttggtag ctcttgatcc ggcaaacaaa 6600 
gcaagcagca gattacgcgc agaaaaaaag 6660 
cggggtctga cgctcagtgg aacgaaaact 6720 
caaaaaggat cttcacctag atccttttaa 6780 
gtatatatga gtaaacttgg tctgacagtt 6840 
cagcgatctg tctatttcgt tcatccatag 6900 
cgatacggga gggcttacca tctggcccca 6960 
caccggctcc agatttatca gcaataaacc 7020 
gtcctgcaac tttatccgcc tccatccagt 7080 
gtagttcgcc agttaatagt ttgcgcaacg 7140 
cacgctcgtc gtttggtatg gcttcattca 7200 
catgatcccc catgttgtgc aaaaaagcgg 7260 
gaagtaagtt ggccgcagtg ttatcactca 7320 
ctgtcatgcc atccgtaaga tgcttttctg 7380 
gagaataccg cgcccggcga ccgagttgct 7440 
tatgacatag cagaacttta aaagtgctca 7500 
tctcaaggat cttaccgctg ttgagatcca 7560 
gatcttcagc atcttttact ttcaccagcg 7620 
atgccgcaaa aaagggaata agggcgacac 7680 
ttcaatatta ttgaagcatt tatcagggtt 7740 
gtatttagaa aaataaacaa ataggggttc 7800 
tatgcggtgt gaaataccgc acagatgcgt 7860 
taaacgttaa tattttgtta aaattcgcgt 7920 
accaataggc cgaaatcggc aaaatccctt 7980 
tgagtgttgt tccagtttgg aacaagagtc 8040 
aagggcgaaa aaccgtctat cagggcgatg 8100 
gttttttgcg gtcgaggtgc cgtaaagctc 8160 
ttagagcttg acggggaaag ccggcgaacg 8220 
gagcgggcgc tagggcgctg gcaagtgtag 8280 
ccgcgcttaa * 8320 



<220> 

<223> Description of Artificial Sequence: plasmid, pAN336 



<400> 70 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 
tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 
ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgcacc aggcttctca 240 
ggcgctgact tagaaaacct cttgaatgaa gctgcgcttg tagcggctcg tcaaaacaag 300 
aaaaaaatcg atgcgcgtga tattgacgaa gcgacggacc gtgtaattgc cggacccgct 360 
aagaagagcc gcgttatctc caagaaagaa cgcaatatcg tggcttatca cgaaggcgga 4 20 
cacaccgtta tcggtctcgt tttagatgag gcagatatgg ttcataaagt aacgattgtt 480 
cctcggggcc aggctggcgg ttatgctgtt atgctgccaa gagaagaccg ttatttccaa 540 
acaaagccgg agctgcttga taaaattgtc ggcctcttgg gcggacgtgt tgctgaagag 600 
attatcttcg gtgaagtcag cacaggggcg cacaatgact tccagcgtgc gacgaatatt 660 
gcaagacgaa tggttacaga attcggtatg tcagaaaaac tgggaccgtt gcaatttgga 720 
cagtctcagg gcggtcaggt attcttaggc cgtgatttca acaacgaaca gaactacagt 780 
gatcaaatcg cttacgaaat tgatcaggaa attcagcgca tcatcaaaga atgttatgag 840 
cgtgcgaaac aaatcctgac tgaaaatcgt gacaagcttg aattgattgc ccaaacgctt 900 
ctgaaagttg aaacgcttga cgctgaacaa atcaaacacc ttatcgatca tggaacatta 960 
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cctgagcgta atttctcaga tgatgaaaag 
aaaacagaag aaaagaaaga cgatacgaaa 
ccggctgacg ctggcagttt ttttatgtaa 
atccaattct ggtatcgatt tgtttacaaa 
atgtttgtaa aaagtaaagt agagaaacta 
ttatcgatgt ggggaacacc ctgcagctcg 
aaataccgca tcaggcgata aacccagcga 
gaggtatgaa aacgagaatt ggacctttac 
aagctaccaa gacgaagagg atgaagagga 
caatactgat aagataatat atcttttata 
gggggcaagg cataggcagc gcgcttatca 
acttgcatgg actaatgctt gaaacccagg 
ataattgtgg tttcaaaatc ggctccgtcg 
actttgaaaa agctgttttc tggtatttaa 
agttcgtctt gttataatta gcttcttggg 
gaaataataa atggctaaaa tgagaatatc 
ccgctgcgta aaagatacgg aaggaatgtc 
aaatgaaaac ctatatttaa aaatgacgga 
ggaacgggaa aaggacatga tgctatggct 
gcactttgaa cggcatgatg gctggagcaa 
ttgctcggaa gagtatgaag atgaacaaag 
gtgcatcagg ctctttcact ccatcgacat 
cagccgctta gccgaattgg attacttact 
aaactgggaa gaagacactc catttaaaga 
ggaaaagccc gaagaggaac ttgtcttttc 
tgtgaaagat ggcaaagtaa gtggctttat 
gtggtatgac attgccttct gcgtccggtc 
tgtcgagcta ttttttgact tactggggat 
tattttactg gatgaattgt tttagtacct 
aagcttttta gacatctaat cttttctgaa 
tttatatctt ttctaaaagt tcgctagata 
tcgccattcg ccattcaggc tgcgcaactg 
caggcaaaac aggacccaag gtcattgcga 
aatcagattg tatagacatc gttgatccat 
atgaaagaaa ccgcgtagga agtgtatagg 
acttgcgtat gacggaaaag ttcgggctta 
ggggcagaga cgccatggta cgtggccgac 
agcttcactt atgctcggcg ctatgctgaa 
gggcggaggt ccgatcggag ctattgtagc 
ctatgtctct aacccgcaag ttcattttga 
acgtgcggtt ggaacaaacg gaacgttaag 
cttcacagga caagtagaaa tcgtttcagg 
tgtgtcatct gagcaggttc cttcatcagt 
taccattctt gcggcagggg gctttattat 
aatcacaaaa attgaacagc gtctatctca 
agggctgaca ccagaagaaa ttttagaaga 
aacgatgcct gtcagattcc attgcccttg 
aggactaggc aaaaaagaaa ttcaagatat 
atgccatttt tgtaatgaaa agtacttatt 
ccaaactacc cgctaagctc tttagcgggt 
gtttgaaatc aagaacaatc tggacgcgtt 
acctaaatag cttggcgtaa tcatggtcat 
tcacaattcc acacaacata cgagccggaa 
gagtgagcta actcacatta attgcgttgc 
tgtcgtgcca gctgcattaa tgaatcggcc 
ggcgctcttc cgcttcctcg ctcactgact 
cggtatcagc tcactcaaag gcggtaatac 
gaaagaacat gtgagcaaaa ggccagcaaa 
tggcgttttt cgataggctc cgcccccctg 
agaggtggcg aaacccgaca ggactataaa 
tcgtgcgctc tcctgttccg accctgccgc 



aacgatgatg tgaaagtaaa cattctgaca 1020 
gagtaattcg ctttctttct aaaaaaactg 1080 
atgattggct cagctgcggc ttttacaatc 1140 
tgagccgctg atcgtgtatg gtattgtaga 1200 
ttcaaaagtg gtgatagagg ttgttactgg 1260 
agtgaaatac cgcacagatg cgtaaggaga 1320 
accatttgag gtgataggta agattatacc 1380 
agaattactc tatgaagcgc catatttaaa 1440 
tgaggaggca gattgccttg aatatattga 1500 
tagaagatat cgccgtatgt aaggatttca 1560 
atatatctat agaatgggca aagcataaaa 1620 
acaataacct tatagcttgt aaattctatc 1680 
atactatgtt atacgccaac tttcaaaaca 1740 
ggttttagaa tgcaaggaac agtgaattgg 1800 
gtatctttaa atactgtaga aaagaggaag 1860 
accggaattg aaaaaactga tcgaaaaata 1920 
tcctgctaag gtatataagc tggtgggaga 1980 
cagccggtat aaagggacca cctatgatgt 2040 
ggaaggaaag ctgcctgttc caaaggtcct 2100 
tctgctcatg agtgaggccg atggcgtcct 2160 
ccctgaaaag attatcgagc tgtatgcgga 2220 
atcggattgt ccctatacga atagcttaga 2280 
gaataacgat ctggccgatg tggattgcga 2340 
tccgcgcgag ctgtatgatt ttttaaagac 2400 
ccacggcgac ctgggagaca gcaacatctt 24 60 
tgatcttggg agaagcggca gggcggacaa 2520 
gatcagggag gatatcgggg aagaacagta 2580 
caagcctgat tgggagaaaa taaaatatta 2640 
agatttagat gtctaaaaag ctttaactac 2700 
gtacatccgc aactgtccat actctgatgr 2760 
ggggtcccga gcgcctacga ggaatttgta 2820 
ttgggaaggg cgatcggtgc ggtcgactgg 2880 
caggaggcct ggcgccgctc attgcgaacg 2940 
tcttaaccct aaaagggctg gaattgattt 3000 
aggtttagta atggattatt tagtaaaagc 3060 
tgcagcgaga acgactgata tggtaaatga 3120 
agcatccgct gcactaggcc gtacaatgac 3180 
gggcgatgat aagctgaccg tgaaaatcga 3240 
tgatgccaat gccaaaggag aagtcagagc 3300 
tttaaatgaa caaggtaagc ttgatgtcag 3360 
tgtcgtaaaa gatttaggtt tgcgcgagtt 3420 
agaattagga gatgatttta cttactatct 3480 
gggcgtaggt gtgctcgtaa atcctgacaa 3540 
tcagctgatg ccgggaacag atgatgaaac 3600 
agtagagccg atttctaagc tcatccaaaa 3660 
agtcctaggc gagaaacctg agattttgga 3720 
ttcaaaagaa cggttcgaaa cagccatttt 3780 
gatagaagaa gatggacaag ccgaagcagt 3840 
tacaaaagaa gagctggaag ggcttcgtga 3900 
ttttaatttg agaaaagggg ctgaaagcag 3960 
ggatgcatag cttgagtatt ctatagtgtc 4020 
agctgtttcc tgtgtgaaat tgttatccgc 4080 
gcataaagtg taaagcctgg ggtgcctaat 4140 
gctcactgcc cgctttccag tcgggaaacc 4200 
aacgcgcggg gagaggcggt ttgcgtattg 42 60 
cgctgcgctc ggtcgttcgg ctgcggcgag 4320 
ggttatccac agaatcaggg gataacgcag 4380 
aggccaggaa ccgtaaaaag gccgcgttgc 4 4 40 
acgagcatca caaaaatcga cgctcaagtc 4500 
gataccaggc gtttccccct ggaagctccc 4560 
ttaccggata cctgtccgcc tttctccctt 4 620 
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cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg 4 680 
ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 4740 
ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag 4800 
ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 4860 
ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc 4920 
cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 4980 
gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 5040 
atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 5100 
ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa 5160 
gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa 5220 
tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc 5280 
ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga 5340 
taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa 5400 
gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt 54 60 
gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttggcattg 5520 
ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc 5580 
aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg 5640 
gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag 5700 
cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt 5760 
actcaaccaa gtcattctga gaataccgcg cccggcgacc gagttgctct tgcccggcgt 5820 
caatacggga taatagtgta tgacatagca gaactttaaa agtgctcatc attggaaaac 5880 
gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac 5940 
ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag 6000 
caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa 6060 
tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga 6120 
gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc 6180 
cccgaaaagt gccacctgta tgcggtgtga aataccgcac agatgcgtaa ggagaaaata 6240 
ccgcatcagg cgaaattgta aacgttaata ttttgttaaa attcgcgtta aatatttgtt 6300 
aaatcagctc attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag 6360 
aatagaccga gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga 6420 
acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg 64 80 
aaccatcacc caaatcaagt tttttgcggt cgaggtgccg taaagctcta aatcggaace 6540 
ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg 6600 
aagggaagaa agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc 6660 
gcgtaaccac cacacccgcc gcgcttaa 6688 

<210> 71 
<211> 9396 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: plasmid, pAN341 and pAN342 
<400> 71 

ttgcggccgc ttcgaactgt tataaaaaaa ggatcaattt tgaactctct cccaaagttg 60 
atcccttaac gatttagaaa tccctttgag aatgtttata tacattcaag gtaaccagcc 120 
aactaatgac aatgattcct gaaaaaagta ataacaaatt actatacaga taagttgact 180 
gatcaacttc cataggtaac aacctttgat caagtaaggg tatggataat aaaccaccta 240 
caattgcaat acctgttccc tctgataaaa agctggtaaa gttaagcaaa ctcattccag 300 
caccagcttc ctgctgtttc aagctacttg aaacaattgt tgatataact gttttggtga 360 
acgaaagccc acctaaaaca aatacgatta taattgtcat gaaccatgat gttgtttcta 420 
aaagaaagga agcagttaaa aagctaacag aaagaaatgt aactccgatg tttaacacgt 480 
ataaaggacc tcttctatca acaagtatcc caccaatgta gccgaaaata atgacactca 540 
ttgttccagg gaaaataatt acacttccga tttcggcagt acttagctgg tgaacatctt 600 
tcatcatata aggaaccata gagacaaacc ctgctactgt tccaaatata attcccccac 660 
aaagaactcc aatcataaaa ggtatatttt tccctaatcc gggatcaaca aaaggatctg 720 
ttactttcct gatatgtttt acaaatatca ggaatgacag cacgctaacg ataagaaaag 780 
aaatgctata tgatgttgta aacaacataa aaaatacaat gcctacagac attagtataa 840 
ttcctttgat atcaaaatga ccttttatcc ttacttcttt ctttaataat ttcataagaa 900 
acggaacagt gataattgtt atcataggaa tgagtagaag ataggaccaa tgaatataat 960 
gggctatcat tccaccaatc gctggaccga ctccttctcc catggctact atcgatccaa 1020 
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taagaccaaa tgctttaccc ctattttcct 
cgagtgctgg aaatgcagct gcaccagccc 
aaaagaaaga atggccaaca aacccaatta 
ataggagtaa ccttttgatg cctaattgat 
tggaaaaggt taacataaag gctgtgttca 
aatcatttgc aatatcaggt aatgagacgt 
aagataaaat gcaaagccaa attaaaattt 
atgtattcac atttcaccct ccaataatga 
cgcttccctc ttttaattga accctgttac 
aaacttgatt aaaacatttt accacatata 
ttatacaaca atatggcccg tttgttgaac 
caattccaca ttgcaataat agaaaatcca 
tgaatcaaat cgccttcttc tgtgtcatca 
aaaccaccat aggagattaa ccttttacgg 
ttcaaattct tttcttcatc atcggtcata 
gtttcgtcaa ttgccgattg tatatccgat 
acttttacat ttggatcata gtctaatttc 
ttttgattca cgtagttttc tgtattctta 
tgcatgtgct gattataaga attatcttta 
aaaccaacaa gatttttatt aattttttta 
atatctgaca aactcttatt taattcttcg 
agaaacaacc aacgaactgt tggcttttgt 
tgaatgccat gtttcattgc tctcctccag 
aaccacactc gatacaactt tctttcgcct 
tcagcacaat cttttactct ttcagccttt 
taatcaacat tagcgatttt cttttctctc 
gttgcaaacg attctggata gtttgccgga 
aagagatccg tattccccgc tgaaaactca 
accatgtctc tgtaccattc atcaagtcca 
tcaggagcca tttccgggtg gggatttcct 
ccagcttcca aaatgtcagc atcagtgccg 
cattttgcgc caggtgctac accgatttga 
gtgcctgtca cgtgggttcc atgagccaaa 
gcatcatacc agttcatttc attttcaggc 
tatttctctt ttaatgccgg atgattccat 
gtgccagttc catcatatcc aagtgcccaa 
cattccacac cgtcagttgc tttaatagct 
gatgacttaa aaagctgccg tttctcatta 
accacttttt ccataacctc ttttgaggca 
gaatgaattt ggtctgcatt tcctttatct 
tgctgggatt catcggctgt gacttttaaa 
tccgtcttag cggcagacag cttcttcgat 
ttttctgggt acccctatgt attactagaa 
ttcaatggag gtgatgtccg tttttcatta 
ctcttggacc cgggatccgc aagtgctttt 
cacttcctac gcggtttctt tcataaatca 
caacgatgtc tatacaatct gattcgttcg 
tgaccttggg tcctgttttg cctgccattt 
gacatagcca aataaaattc cagattgcat 
gtcgggccgg gtgatttcga tacgaggaag 
cgaaattgta atcccagggg caatcgcccc 
gcagtacgtt gtggcggttc cgaaatcgac 
cgcagcgaca gcatttacga ttctgtctgc 
atttaaaccg gttttcatac ctggaccaac 
gcacattctt tctaacgcaa acatgattgg 
tatctgttca aacataagcc cggagtgatc 
atcttctgtt ttatgcctgc ttgtttctat 
atatacacca agtacagtat tggtgttccc 
ccacttttga atagtttctc tagaacaggc 
acacaccctg tagggaaagt caataccttt 
tctagggaca cctctttgta cccctcaagg 



ttggaatata gcgcgcaact acaaccatta 1080 
cttgaataaa acgagccata ataagtaagg 1140 
ccgacccgaa acaatttatt ataattccaa 1200 
cagatagctt tccatataca gctgttccaa 1260 
cccagtttgt actcgcaggt ggtttattaa 1320 
tcaaaaccat ttcatttaat acgctaaaaa 1380 
ggttgtgtcg taaattcgat tgtgaatagg 1440 
gggcagacgt agtttatagg gttaatgata 1500 
attcattaca cttcataatt aattcctcct 1560 
aactaagttt taaattcagt atttcatcac 1620 
tactctttaa taaaataatt tttccgttcc 1680 
tcttcatcgg ctttttcgtc atcatctgta 1740 
aggtttaatt ttttatgtat ttcttttaac 1800 
tgtaaacctt cctccaaatc agacaaacgt 1860 
aaatccgtat cctttacagg atattttgca 1920 
ttatatttat ttttcggtcg aatcatttga 1980 
a-ttgcctttt tccaaaattg aatccattgt 2040 
aaataagttg gttccacaca taccaataca 2100 
ttatttattg tcacttccgt tgcacgcata 2160 
tattgcatca ttcggcgaaa tccttgagcc 2220 
ccatcataaa catttttaac tgttaatgtg 2280 
ttaataactt cagcaacaac cttttgtgac 2340 
ttgcacattg gacaaagcct ggatttacaa 2400 
gtttcacgat tttgtttata ctctaatatt 2460 
ttaaattcaa gaatatgcag aagttcaaag 2520 
catggggaat tggaattctc agtcgctcca 2580 
tttgcgatag aaccaggccc gccgggaata 2640 
gggaaaatat cggccgaacg ccaggcattg 2700 
gagccccctc cccatgagtt attgacaaca 2760 
tccgcgtcct ttggtgctaa aacccattca 2820 
ccatcttcag agaacgcttt aacagcaatc 2880 
tttgttccat caggttcaga gcccaccatc 2940 
tcatcataag ggcttgcctc gcctgctacg 3000 
tcattaggat tttccggatt atatccgcga 3060. 
tccaccccgg tatcaatgga cgcaacaacc 31.20 
gcttttgggg catcgatttg gtctacattc 3180 
ttctgtgctt ttttcatatt aaarggggag 3240 
ggaagcacct tttccacttc gggaaactgc 3300 
tgaacagcaa tcccgttcac cacataataa 3360 
ttctgggtgt tcaagtattt taggacatct 3420 
gatgacacaa cagcagaacg cttttgatat 3480 
ttcgcttttt taacagccgc ttttgccgct 3540 
aataacatag taaaacggac atcactccgt 3600 
caacaaatta cttatctatt tgtaatgctg 3660 
actaaataat ccattactaa acctcctata 3720 
attccagccc ttttagggtt aagaatggat 3780 
caatgagcgg cgccaggcct cctgtcgcaa 3840 
cattcgctta acgattcctt ccacttggcc 3900 
cgcgctaaca gtgttttttc cgataatatt 3960 
ctttgctgca cgcgagtaaa gcgcctctgt 4020 
gcccatgtat tgtttgtttt catcaatata 4080 
aacaattaat ggattgccgt acaagtgtat 4140 
ccctacttct ttcggattgt catattttat 4200 
aatttgaggc tcgatatgaa agtattttgt 4260 
cggcactact gacgaaataa taatgccatc 4320 
aaataaggag cgcaaaatca tcccaaactc 4380 
acgccagtga tattctaatt ttccatcatg 4440 
cacatcgata accagtaaca acctctatca 4500 
ggggttgccc ccgcctgtaa ttaaattatt 4560 
ttgtaaaatt tttacacagc gtggatctct 4 620 
gagaaatatt ggcggtactg agcacagttt 4 680 
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tggttggtgg acagtgaacc atagctgtcg 
aaaggaaaca aaaagaccgt tttcacacaa 
acagccactt ttgcaaaaac cggacagctt 
agcttcgcga agcggccgca aaattcactg 
aaccctggcg ttacccaact taatcgcctt 
aatagcgaag aggcccgcac cgatcgccct 
tggcgcctga tgcggtattt tctccttacg 
tgcactctca gtacaatctg ctctgatgcc 
acacccgctg actatgcttg taaaccgttt 
acctctaggg tccccaatta attagtaata 
ccaccggatc agcttagtaa agccctcgct 
gccaactatt gcgataacaa gaaaaagcca 
agggcttatt atgcacgctt aaaaataata 
gagcaattat gtgcttagtg catctaacgc 
ggcttgaacg aattgttaga cattatttgc 
agtggacaaa ttcttccaac tgatctgcgc 
gataagcctg tctagcttca agtatgacgg 
cccagtcggc agcgacatcc ttcggcgcga 
gggacaacgt aagcactaca tttcgctcat 
gcgttaaggt ttcatttagc gcctcaaata 
cctccgccgc tggacctacc aaggcaacgc 
ccagatcaat gtcgatcgtg gctggctcga 
attctccaaa ttgcagttcg cgcttagctg 
caacaatggt gacttctaca gcgcggagaa 
ccaaaaggtc gttgatcaaa gctcgccgcg 
ccagcaaatc aatatcactg tgtggcttca 
gtacggccag caacgtcggt tcgagatggc 
gagtcgatac ttcggcgatc accgcttccc 
tgccctgctg cgtaacatcg ttgctgctcc 
gcgcttgctg cttggatgcc cgaggcatag 
ccatgaaaac cgccactgcg ccgttaccac 
tgcgtgagcg catacgctac ttgcattaca 
gggttcgtgc cttcatccgt ttccacggtg 
aagtcgaggc atttctgtcc tggctggcga 
gtcaggcatt ggcggccttg ctgttcttct 
ggcttcagga gatcggaaga cctcggccgt 
atgaagtggt tcgcatcctc ggttttctgg 
tgtatggaac gggcatgcgg atcagtgagg 
tcgatcacgg cacgatcatc gtgcgggagg 
tacccgagag cttggcaccc agcctgcgcg 
t'tgaatgacc tttaatagat tatattacta 
ttattttaaa aattttttca caaaacggtt 
acgggctgtt ctggtgttgc tagtttgtta 
cggctgaaag cgctatttct tccagaattg 
tggctcccgt gttgtcggca gctttgattc 
tatgtgtgac tgttgagctg taacaagttg 
ctttgtttta ctggtttcac ctgttctatt 
gtcgatctgt tcatggtgaa cagctttaaa 
atctatcttt tttacaccgt tttcatctgt 
acggtgaaca gttgttctac ttttgtttgt 
agccataaga acctcagatc cttccgtatt 
gtttttgcgt gagccatgag aacgaaccat 
aaaattttgc ctcaaaactg gtgagctgaa 
ttcttagtcc gttacgtagg taggaatctg 
tcatttttat ctggttgttc tcaagttcgg 
cttggaaaat caacgtatca gtcgggcggc 
tgtaagtgtt taaatcttta cttattggtt 
catggtagtt attttcaagc attaacatga 
ttgccttgtg agttttcttt tgtgttagtt 
agtatttgtt ttcaaaagac ttaacatgtt 
ggaaaagata aggcaatatc tcttcactaa 



tcaatagcct cgagttatgg cagttggtta 4740 
aacggtcttt ttcgatttct ttttacagtc 4800 
catgccttat aactgctgtt tcggtcgaca 4860 
gccgtcgttt tacaacgtcg tgactgggaa 4 920 
gcagcacatc cccctttcgc cagctggcgt 4 980 
tcccaacagt tgcgcagcct gaatggcgaa 5040 
catctgtgcg gtatttcaca ccgcatatgg 5100 
gcatagttaa gccagccccg acacccgcca 5160 
tgtgaaaaaa tttttaaaat aaaaaagggg 5220 
taatctatta aaggtcattc aaaaggtcat 5280 
agattttaat gcggatgttg cgattacttc 5340 
gcctttcatg atatatctcc caatttgtgt 5400 
aaagcagact tgacctgata gtttggctgt 54 60 
ttgagttaag ccgcgccgcg aagcggcgtc 5520 
cgactacctt ggtgatctcg cctttcacgt 5580 
gcgaggccaa gcgatcttct tcttgtccaa 5640 
gctgatactg ggccggcagg cgctccattg 5700 
ttttgccggt .tactgcgctg taccaaatgc 57 60 
cgccagccca gtcgggcggc gagttccata 5820 
gatcctgttc aggaaccgga tcaaagagtt 5880 
tatgttctct tgcttttgtc agcaagatag 5940 
agatacctgc aagaatgtca ttgcgctgcc 6000 
gataacgcca cggaatgatg tcgtcgtgca- 6060 
tctcgctctc tccaggggaa gccgaagttt 6120 
ttgtttcatc aagccttacg gtcaccgtaa 6180 
ggccgccatc cactgcggag ccgtacaaat 6240 
gctcgatgac gccaactacc tctgatagtt 6300 
tcatgatgtt taactttgtt ttagggcgac 6360 
ataacatcaa acatcgaccc acggcgtaac 6420 
actgtacccc aaaaaaacag tcataacaag 6480 
cgctgcgttc ggtcaaggtt ctggaccagt 6540 
gcttacgaac cgaacaggct tatgtccact 6600 
tgcgtcaccc ggcaaccttg ggcagcagcg 6660 
acgagcgcaa ggtttcggtc tccacgcatc 6720 
acggcaaggt gctgtgcacg gatctgccct 6780 
cgcggcgctt gccggtggtg ctgaccccgg 6840 
aaggcgagca tcgtttgttc gcccagcttc 6900 
gtttgcaact. gcgggtcaag gatctggatt 6960 
gcaagggctc caaggatcgg gccttgatgt 7020 
agcaggggaa ttgatccggt ggatgacctt 7080 
attaattggg gaccctagag gtcccctttt 7140 
tacaagcata acgggttttg ctgcccgcaa 7200 
tcagaatcgc agatccggct tcaggtttgc 7260 
ccatgatttt ttccccacgg gaggcgtcac 7320 
gataagcagc atcgcctgtt tcaggctgtc 7380 
tctcaggtgt tcaatttcat gttctagttg 7440 
aggtgttaca tgctgttcat ctgttacatt 7500 
tgcaccaaaa actcgtaaaa gctctgatgt 7560 
gcatatggac agttttccct ttgatatcta 7620 
tagtcttgat gcttcactga tagatacaag 7680 
tagccagtat gttctctagt gtggttcgtt 7740 
tgagatcatg cttactttgc atgtcactca 7800 
tttttgcagt taaagcatcg tgtagtgttt 7860 
atgtaatggt tgttggtatt ttgtcaccat 7920 
ttacgagatc catttgtcta tctagttcaa 7980 
ctcgcttatc aaccaccaat ttcatattgc 8040 
tcaaaaccca ttggttaagc cttttaaact 8100 
acttaaattc atcaaggcta atctctatat 8160 
cttttaataa ccactcataa atcctcatag 8220 
ccagattata ttttatgaat ttttttaact 8280 
aaactaattc taatttttcg cttgagaact 8340 
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tggcatagtt 
cagttctcgt 
tgatgttcat 
tagggttttc 
gctccgttaa 
acatacatct 
aatgataatt 
ttgctggaaa 
ttttttttgt 
taaaaagaat 
acaaaaggat 
gcttaagtag 
atcaggcacc 
gcagtgaatg 
cccataatac 
tgtggtgcta 
ccacttcgga 
ggtatcatca 



tgtccactgg 
catcagctct 
catctgagcg 
aatcgtgggg 
gtcatagcga 
caattggtct 
actagtcctt 
acttgtaaat 
ttatattcaa 
agatcccagc 
gtcgcaaacg 
caccctcgca 
tgagtcgctg 
ggggtaaatg 
aagaaaagcc 
tctgactttt 
ttatcccgtg 
acaggcttac 



aaaatctcaa 
ctggttgctt 
tattggttat 
ttgagtagtg 
ctaatcgcta 
aggtgatttt 
ttcctttgag 
tctgctagac 
gtggttataa 
cctgtgtata 
ctgtttgctc 
agctcgggca 
tctttttcgt 
gcactacagg 
cgtcacgggc 
tgctgttcag 
acaggtcatt 
ccgtcttact 



agcctttaac 
tagctaatac 
aagtgaacga 
ccacacagca 
gttcatttgc 
aatcactata 
ttgtgggtat 
cctctgtaaa 
tttatagaat 
actcactact 
ctctacaaaa 
aatcgctgaa 
gacattcagt 
cgccttttat 
ttctcagggc 
cagttcctgc 
cagactggct 
gtcaac 



caaaggattc 
accataagca 
taccgtccgt 
taaaattagc 
tttgaaaaca 
ccaattgaga 
ctgtaaattc 
ttccgctaga 
aaagaaagaa 
ttagtcagtt 
cagaccttaa 
tattcctttt 
tcgctgcgct 
ggattcatgc 
gtttt'atggc 
cctctgattt 
aatgcaccca 



ctgatttcca 8400 
ttttccctac 8460 
tctttccttg 8520 
ttggtttcat 8580 
actaattcag 8640 
tgggctagtc 8700 
tgctagacct 8760 
cctttgtgtg 8820 
taaaaaaaga 8880 
ccgcagtatt 8940 
aaccctaaag 9000 
gtctccgacc 9060 
cacggctctg 9120 
aaggaaacta 9180 
gggtctgcta 9240 
tccagtctga 9300 
gtaaggcagc 9360 

9396 



<210> 72 
<211> 9292 
<212> DNA 
<213> Artific 



al Sequence 



<220> 

<223> Description of Artificial Sequence: plasmid, pAN329 and pAN330 



<400> 72 

t.tgcggccgc 

atcccttaac 

aactaatgac 

gatcaacttc 

caatt gcaat 

caccagctt c 

acgaaagccc 

aaagaaagga 

ataaaggacc 

ttgtt ccagg 

tcatcatata 

aaagaactcc 

ttactttcct 

aaatgctata 

ttcctttgat 

acggaacagt 

gggctatcat 

taagaccaaa 

cgagtgctgg 

aaaagaaaga 

ataggagtaa 

tggaaaaggt 

aatcatttgc 

aagataaaat 

atgtattcac 

cgcttccctc 

aaacttgatt 

ttatacaaca 

caattccaca 

tgaatcaaat 

aaaccaccat 

ttcaaattct 

gtttcgtcaa 



ttcgaactgt 
gatttagaaa 
aatgatt cct 
cataggt aac 
acctgtt ccc 
ctgctgtt t c 
acctaaaaca 
agcagttaaa 
tcttctatca 
gaaaataatt 
aggaaccata 
aat cataaaa 
gatatgtttt 
tgatgttgta 
atcaaaatga 
gataattgtt 
tccaccaatc 
tgctttaccc 
aaatgcagct 
atggccaaca 
ccttttgatg 
taacataaag 
aatatcaggt 
gcaaagccaa 
atttcaccct 
ttttaattga 
aaaacatttt 
atatggcccg 
ttgcaataat 
cgccttcttc 
aggagattaa 
tttcttcatc 
ttgccgattg 



tataaaaaaa 
tccctttgag 
gaaaaaagta 
aacct ttgat 
tctgataaaa 
aagctacttg 
aatacgatta 
aagctaacag 
acaagtatcc 
acacttccga 
gagacaaacc 
ggtatatttt 
acaaatatca 
aacaacataa 
ccttttatcc 
atcataggaa 
gctggaccga 
ctattttcct 
gcaccagccc 
aacccaatta 
cctaattgat 
gctgtgttca 
aatgagacgt 
attaaaattt 
ccaataatga 
accctgttac 
accacatata 
tttgttgaac 
agaaaatcca 
tgtgtcatca 
ccttttacgg 
atcggtcata 
tatatccgat 



ggat caattt 
aatgtttata 
ataacaaatt 
caagtaaggg 
agctggtaaa 
aaacaattgt 
taattgtcat 
aaagaaatgt 
caccaatgta 
t ttcggcagt 
ctgctact gt 
tccctaatcc 
ggaatgacag 
aaaatacaat 
ttacttcttt 
tgagtagaag 
ctccttctcc 
ttggaatata 
cttgaataaa 
ccgacccgaa 
cagatagctt 
cccagtttgt 
tcaaaaccat 
ggttgtgtcg 
gggcagacgt 
attcattaca 
aactaagttt 
tactctttaa 
tcttcatcgg 
aggtttaatt 
tgtaaacctt 
aaatccgtat 
ttatatttat 



tgaactctct 
tacattcaag 
actatacaga 
ta tggataat 
gttaagcaaa 
tgatataact 
gaaccatgat 
aact ccgatg 
gccgaaaata 
acttagctgg 
tccaaatata 
gggatcaaca 
cacgctaacg 
gcctacagac 
ctttaataat 
ataggaccaa 
catggctact 
gcgcgcaact 
acgagccata 
acaatttatt 
tccatataca 
actcgcaggt 
ttcatttaat 
taaattcgat 
agtttatagg 
cttcataatt 
taaattcagt 
taaaataatt 
ctttttcgtc 
ttttatgtat 
cctccaaatc 
cctttacagg 
ttttcggtcg 



cccaaagttg 60 
gtaaccagcc 120 - 
taagttgact 180 
aaaccaccta 24 0 
ctcattccag 300 
gttttggtga 360 
gttgtttcta 420 
tttaacacgt 480 
atgacactca 540 
tgaacatctt 600 
attcccccac 660. 
aaaggatctg 720 
ataagaaaag 780 
attagtataa 840 
ttcataagaa 900 
tgaatataat 960 
atcgatccaa 1020 
acaaccatta 1080 
ataagtaagg 1140 
ataattccaa 1200 
gctgttccaa 1260 
ggtttattaa 1320 
acgctaaaaa 1380 
tgtgaatagg 14 4 0 
gttaatgata 1500 
aattcctcct 1560 
atttcatcac 1620 
tttccgttcc 1680 
atcatctgta 1740 
ttcttttaac 1800 
agacaaacgt 1860 
atattttgca 1920 
aatcatttga 1980 



OGZ-001 



- 56- 



acttttacat ttggatcata gtctaatttc attgcctttt tccaaaattg aatccattgt 2040 
ttttgattca cgtagttttc tgtattctta aaataagttg gttccacaca taccaataca 2100 
tgcatgtgct gattataaga attatcttta ttatttattg tcacttccgt tgcacgcata 2160 
aaaccaacaa gatttttatt aattttttta tattgcatca ttcggcgaaa tccttgagcc 2220 
atatctgaca aactcttatt taattcttcg ccatcataaa catttttaac tgttaatgtg 2280 
agaaacaacc aacgaactgt tggcttttgt ttaataactt cagcaacaac cttttgtgac 2340 
tgaatgccat gtttcattgc tctcctccag ttgcacattg gacaaagcct ggatttacaa 2400 
aaccacactc gatacaactt tctttcgcct gtttcacgat tttgtttata ctctaatatt 24 60 
tcagcacaat cttttactct ttcagccttt ttaaattcaa gaatatgcag aagttcaaag 2520 
taatcaacat tagcgatttt cttttctctc catggggaat tggaattctc agtcgctcca 2580 
gttgcaaacg attctggata gtttgccgga tttgcgatag aaccaggccc gccgggaata 2640 
aagagatccg tattccccgc tgaaaactca gggaaaatat cggccgaacg ccaggcattg 2700 
accatgtctc tgtaccattc atcaagtcca gagccccctc cccatgagtt attgacaaca 27 60 
tcaggagcca tttccgggtg gggatttcct tccgcgtcct ttggtgctaa aacccattca 2820 
ccagcttcca aaatgtcagc atcagtgccg ccatcttcag agaacgcttt aacagcaatc 2880 
cattttgcgc caggtgctac accgatttga tttgttccat caggttcaga gcccaccatc 2940 
gtgcctgtca cgtgggttcc atgagccaaa tcatcataag ggcttgcctc gcctgctacg 3000 
gcatcatacc agttcatttc attttcaggc tcattaggat tttccggatt atatccgcga 3060 
tatttctctt ttaatgccgg atgattccat tccaccccgg tatcaatgga cgcaacaacc 3120 
gtgccagttc catcatatcc aagtgcccaa gcttttgggg catcgatttg gtctacattc 3180 
cattccacac cgtcagttgc tttaatagct ttctgtgctt ttttcatatt aaatggggag 3240 
gatgacttaa aaagctgccg tttctcatta ggaagcacct tttccacttc gggaaactgc 3300 
accacttttt ccataacctc ttttgaggca tgaacagcaa tcccgttcac cacataataa 3360 
gaatgaattt ggtctgcatt tcctttatct ttctgggtgt tcaagtattt taggacatct 3420 
tgctgggatt catcggctgt gacttttaaa gatgacacaa cagcagaacg cttttgatat 3480 
tccgtcttag cggcagacag cttcttcgat ttcgcttttt taacagccgc ttttgccgct 3540 
ttttctgggt acccctatgt attactagaa aataacatag taaaacggac atcactccgt 
ttcaatggag gtgatgtccg tttttcatta caacaaatta cttatctatt tgtaatgctg 
ctcttggacc cgggatccac gatgtctata caatctgatt cgttcgcaat gagcggcgcc 3720 

3780 
3840 
3900 
3960 
4020 



3600 
3660 



aggcctcctg tcgcaatgac cttgggtcct gttttgcctg ccatttcatt cgcttaacga 
ttccttccac ttggccgaca tagccaaata aaattccaga ttgcatcgcg ctaacagtgt 
tttttccgat aatattgtcg ggccgggtga tttcgatacg aggaagcttt gctgcacgcg 
agtaaagcgc ctctgtcgaa attgtaatcc caggggcaat cgccccgccc atgtattgtt 
tgttttcatc aatatagcag tacgttgtgg cggttccgaa atcgacaaca attaatggat 
tgccgtacaa gtgtatcgca gcgacagcat ttacgattct gtctgcccct acttctttcg 4080 
gattgtcata ttttatattt aaaccggttt tcatacctgg accaacaatt tgaggctcga 4140 
tatgaaagta ttttgtgcac attctttcta acgcaaacat gattggcggc actactgacg 4200 
aaataataat gccatctatc tgttcaaaca taagcccgga gtgatcaaat aaggagcgca 4260 
aaatcatccc aaactcatct tctgttttat gcctgcttgt ttctatacgc cagtgatatt 4320 
ctaattttcc atcatgatat acaccaagta cagtattggt gttccccaca tcgataacca 4380 
gtaacaacct ctatcaccac ttttgaatag tttctctaga acaggcgggg ttgcccccgc 4440 
ctgtaattaa attattacac accctgtagg gaaagtcaat acctttttgt aaaattttta 4500 
cacagcgtgg atctcttcta gggacacctc tttgtacccc tcaagggaga aatattggcg 4560 
gtactgagca cagttttggt tggtggacag tgaaccatag ctgtcgtcaa tagcctcgag 4 620 
ttatggcagt tggttaaaag gaaacaaaaa gaccgttttc acacaaaacg gtctttttcg 4680 
atttcttttt acagtcacag ccacttttgc aaaaaccgga cagcttcatg ccttataact 4740 
gctgtttcgg tcgacaagct tcgcgaagcg gccgcaaaat tcactggccg tcgttttaca 4800 
acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc 4860 
tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg 4920 
cagcctgaat ggcgaatggc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat 4980 
ttcacaccgc atatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca 5040 
gccccgacac ccgccaacac ccgctgacta tgcttgtaaa ccgttttgtg aaaaaatttt 5100 
taaaataaaa aaggggacct ctagggtccc caattaatta gtaatataat ctattaaagg 5160 
tcattcaaaa ggtcatccac cggatcagct tagtaaagcc ctcgctagat tttaatgcgg 5220 
atgttgcgat tacttcgcca actattgcga taacaagaaa aagccagcct ttcatgatat 5280 
atctcccaat ttgtgtaggg cttattatgc acgcttaaaa ataataaaag cagacttgac 5340 
ctgatagttt ggctgtgagc aattatgtgc ttagtgcatc taacgcttga gttaagccgc 5400 
gccgcgaagc ggcgtcggct tgaacgaatt gttagacatt atttgccgac taccttggtg 54 60 
atctcgcctt tcacgtagtg gacaaattct tccaactgat ctgcgcgcga ggccaagcga 5520 
tcttcttctt gtccaagata agcctgtcta gcttcaagta tgacgggctg atactgggcc 5580 
ggcaggcgct ccattgccca gtcggcagcg acatccttcg gcgcgatttt gccggttact 5640 
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gcgctgtacc aaatgcggga caacgtaagc 
ggcggcgagt tccatagcgt taaggtttca 
accggatcaa agagttcctc cgccgctgga 
tttgtcagca agatagccag atcaatgtcg 
atgtcattgc gctgccattc tccaaattgc 
atgatgtcgt cgtgcacaac aatggtgact 
ggggaagccg aagtttccaa aaggtcgttg 
cttacggtca ccgtaaccag caaatcaata 
gcggagccgt acaaatgtac ggccagcaac 
actacctctg atagttgagt cgatacttcg 
tttgttttag ggcgactgcc ctgctgcgta 
cgacccacgg cgtaacgcgc ttgctgcttg 
aaacagtcat aacaagccat gaaaaccgcc 
aaggttctgg accagttgcg tgagcgcata 
caggcttatg tccactgggt tcgtgccttc 
accttgggca gcagcgaagt cgaggcattt 
tcggtctcca cgcatcgtca ggcattggcg 
tgcacggatc tgccctggct tcaggagatc 
gtggtgctga ccccggatga agtggttcgc 
ttgttcgccc agcttctgta tggaacgggc 
gtcaaggatc tggatttcga tcacggcacg 
gatcgggcct tgatgttacc cgagagcttg 
tccggtggat gaccttttga atgaccttta ■ 
ctagaggtcc ccttttttat tttaaaaatt 
gttttgctgc ccgcaaacgg gctgttctgg 
.ccggcttcag gtttgccggc tgaaagcgct 
ccacgggagg cgtcactggc tcccgtgttg 
cctgtttcag gctgtctatg tgtgactgtt 
tttcatgttc tagttgcttt gttttactgg 
gttcatctgt tacattgtcg atctgttcat 
gtaaaagctc tgatgtatct atctttttta 
ttccctttga tatctaacgg tgaacagttg 
cactgataga tacaagagcc ataagaacct 
tctagtgtgg ttcgttgttt ttgcgtgagc 
ctttgcatgt cactcaaaaa ttttgcctca 
gcatcgtgta gtgtttttct tagtccgtta 
ggtattttgt caccattcat ttttatctgg 
tgtctatcta gttcaacttg gaaaatcaac 
accaatttca tattgctgta agtgtttaaa 
ttaagccttt taaactcatg gtagttattt 
aggctaatct ctatatttgc cttgtgagtt 
tcataaatcc tcatagagta tttgttttca 
atgaattttt ttaactggaa aagataaggc 
ttttcgcttg agaacttggc atagtttgtc 
ggattcctga tttccacagt tctcgtcatc 
taagcatttt ccctactgat gttcatcatc 
gtccgttctt tccttgtagg gttttcaatc 
attagcttgg tttcatgctc cgttaagtca 
aaa'acaacta attcagacat acatctcaat 
ttgagatggg ctagtcaatg ataattacta 
aaattctgct agacctttgc tggaaaactt 
gctagacctt tgtgtgtttt ttttgtttat 
aaagaataaa aaaagataaa aagaatagat 
tcagttccgc agtattacaa aaggatgtcg 
ccttaaaacc ctaaaggctt aagtagcacc 
ccttttgtct ccgaccatca ggcacctgag 
tgcgctcacg gctctggcag tgaatggggg 
tcatgcaagg aaactaccca taatacaaga 
tatggcgggt ctgctatgtg gtgctatctg 
tgattttcca gtctgaccac ttcggattat 
cacccagtaa ggcagcggta tcatcaacag 



actacatttc gctcatcgcc agcccagtcg 5700 
tttagcgcct caaatagatc ctgttcagga 5760 
cctaccaagg caacgctatg ttctcttgct 5820 
atcgtggctg gctcgaagat acctgcaaga 5880 
agttcgcgct tagctggata acgccacgga 5940 
tctacagcgc ggagaatctc gctctctcca 6000 
atcaaagctc gccgcgttgt ttcatcaagc 6060 
tcactgtgtg gcttcaggcc gccatccact 6120 
gtcggttcga gatggcgctc gatgacgcca 6180 
gcgatcaccg cttccctcat gatgtttaac 6240 
acatcgttgc tgctccataa catcaaacat 6300 
gatgcccgag gcatagactg taccccaaaa 6360 
actgcgccgt taccaccgct gcgttcggtc 6420 
cgctacttgc attacagctt acgaaccgaa 6480 
atccgtttcc acggtgtgcg tcacccggca 6540 
ctgtcctggc tggcgaacga gcgcaaggtt 6600 
gccttgctgt tcttctacgg caaggtgctg 6660 
ggaagacctc ggccgtcgcg gcgcttgccg 6720 
atcctcggtt ttctggaagg cgagcatcgt 6780 
atgcggatca gtgagggttt gcaactgcgg 6840 
atcatcgtgc gggagggcaa gggctccaag 6900 
gcacccagcc tgcgcgagca ggggaattga 6960 
atagattata ttactaatta attggggacc 7020 
ttttcacaaa acggtttaca agcataacgg 7080 
tgttgctagt ttgttatcag aatcgcagat 7140 
atttcttcca gaattgccat gattttttcc 7200 
tcggcagctt tgattcgata agcagcatcg 7260 
gagctgtaac aagttgtctc aggtgttcaa 7320 
tttcacctgt tctattaggt gttacatgct 7380 
ggtgaacagc tttaaatgca ccaaaaactc 7440 
caccgttttc atctgtgcat atggacagtt 7500 
ttctactttt gtttgttagt cttga'tgctt 7560 
cagatccttc cgtatttagc cagtatgttc 7620 
catgagaacg aaccattgag atcatgctta 7680 
aaactggtga gctgaatttt tgcagttaaa 7740 
cgtaggtagg aatctgatgt aatggttgtt 7800 
ttgttctcaa gttcggttac gagatccatt 7860 
gtatcagtcg ggcggcctcg cttatcaacc 7920 
tctttactta ttggtttcaa aacccattgg 7980 
tcaagcatta acatgaactt aaattcatca 8040 
ttcttttgtg ttagttcttt taataaccac 8100 
aaagacttaa catgttccag attatatttt 8160 
aatatctctt cactaaaaac taattctaat 8220 
cactggaaaa tctcaaagcc tttaaccaaa 8280 
agctctctgg ttgctttagc taatacacca 8340 
tgagcgtatt ggttataagt gaacgatacc 8400 
gtggggttga gtagtgccac acagcataaa 84 60 
tagcgactaa tcgctagttc atttgctttg 8520 
tggtctaggt gattttaatc actataccaa 8580 
gtccttttcc tttgagttgt gggtatctgt 8640 
gtaaattctg ctagaccctc tgtaaattcc 8700 
attcaagtgg ttataattta tagaataaag 8760 
cccagccctg tgtataactc actactttag 8820 
caaacgctgt ttgctcctct acaaaacaga 8880 
ctcgcaagct cgggcaaatc gctgaatatt 8940 
tcgctgtctt tttcgtgaca ttcagttcgc 9000 
taaatggcac tacaggcgcc ttttatggat 9060 
aaagcccgtc acgggcttct cagggcgttt 9120 
actttttgct gttcagcagt tcctgccctc 9180 
cccgtgacag gtcattcaga ctggctaatg 9240 
gcttacccgt cttactgtca ac 9292 
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<210> 73 
<211> 3964 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid, pOTP71 
<400> 73 

ccatcgaatg gccagatgat taattcctaa tttttgttga cactctatca ttgatagagt 
tattttacca ctccctatca gtgatagaga aaagtgaaat gaatagttcg acaaaaatct 120 
agaaaaggag gaatttaaat gttactggtt atcgatgtgg ggaacaccaa tactgtactt 



60 



180 



ggtgtatatc atgatggaaa attagaatat cactggcgta tagaaacaag caggcataaa 240 



300 



acagaagatg agtttgggat gattttgcgc tccttatttg atcactccgg gcttatgttt 
gaacagatag atggcattat tatttcgtca gtagtgccgc caatcatgtt tgcgttagaa 360 
agaatgtgca caaaatactt tcatatcgag cctcaaattg ttggtccagg tatgaaaacc 420 
ggtttaaata taaaatatga caatccgaaa gaagtagggg cagacagaat cgtaaatgct 
gtcgctgcga tacacttgta cggcaatcca ttaattgttg tcgatttcgg aaccgccaca 
acgtactgct atattgatga aaacaaacaa tacatgggcg gggcgattgc ccctgggatt 
acaatttcga cagaggcgct ttactcgcgt gcagcaaagc ttcctcgtat cgaaatcacc 
cqqcccqaca atattatcgg aaaaaacact gttagcgcga tgcaatctgg aattttattt 720 
^ 780 

840 
900 
960 



480 
540 
600 
660 



ggctatgtcg gccaagtgga aggaatcgtt aagcgaatga aatggcaggc aaaacaggac 
cccaaggtca ttgcgacagg aggcctggcg ccgotcattg cgaacgaatc agattgtata 
gacatcgttg atccattctt aaccctaaaa gggctggaat tgatttatga aagaaaccgc 
gtaggaagtg tataaggatc cctcgaggtc gacctgcagg gggaccatgg tctcagcgct 
fggagccacc cgcagttcga aaaataataa gcttgacctg tgaagtgaaa aatggcgcac 1020 
attgtgcgac attttttttg tctgccgttt accgctactg cgtcacggat ctccacgcgc 108ft 
cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac 1140, 
ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg 1200. 
ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga tttagtgctt 12 60; 
facggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt gggccatcgc 1320^ 
cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct 1380 
tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat ttataaggga 1440 
ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga 1500- 
attttaacaa aatattaacg cttacaattt caggtggcac ttttcgggga aatgtgcgcg 1560 
gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat 1620 
aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc 1680 
gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa 1740 
cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac 1800' 
tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga 1860 
tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag 1920 
agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca 1980 
cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca 2040 
tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa 2100 
ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc 2160 
tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa 2220 
cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa caattgatag 2280 
actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct 2340 
ggtttattgc tgataaatct ggagccggtg agcgtggctc tcgcggtatc attgcagcac 2400 
tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa 24 60 
ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt 2520 
aggaattaat gatgtctcgt ttagataaaa gtaaagtgat taacagcgca ttagagctgc 2580 
ttaatgaggt cggaatcgaa ggtttaacaa cccgtaaact cgcccagaag ctaggtgtag 2640 
agcagcctac attgtattgg catgtaaaaa ataagcgggc tttgctcgac gccttagcca 2700 
ttgagatgtt agataggcac catactcact tttgcccttt agaaggggaa agctggcaag 2760 
attttttacg taataacgct aaaagtttta gatgtgcttt actaagtcat cgcgatggag 2820 
caaaagtaca tttaggtaca cggcctacag aaaaacagta tgaaactctc gaaaatcaat 2880 
tagccttttt atgccaacaa ggtttttcac tagagaatgc attatatgca ctcagcgcag 2940 
tggggcattt tactttaggt tgcgtattgg aagatcaaga gcatcaagtc gctaaagaag 3000 
aaagggaaac acctactact gatagtatgc cgccattatt acgacaagct atcgaattat 3060 
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ttgatcacca 
tagaaaaaca 
ggtaacttca 
aaatccctta 
gatcttcttg 
cgctaccagc 
ctggcttcag 
accacttcaa 
tggctgctgc 
cggataaggc 
gaacgaccta 
ccgaagggag 
cgagggagct 
tctgacttga 
ccagcaacgc 
gaca 



aggtgcagag 
acttaaatgt 
ctagtttaaa 
acgtgagttt 
agatcctttt 
ggtggtttgt 
cagagcgcag 
gaactctgta 
cagtggcgat 
gcagcggtcg 
caccgaactg 
aaaggcggac 
tccaggggga 
gcgtcgattt 
ggccttttta 



ccagccttct 
gaaagtgggt 
aggatctagg 
tcgttccact 
tttctgcgcg 
ttgccggatc 
ataccaaata 
gcaccgccta 
aagtcgtgtc 
ggctgaacgg 
agatacctac 
aggtatccgg 
aacgcctggt 
ttgtgatgct 
cggttcctgg 



tattcggcct 
cttaaaagca 
tgaagatcct 
gagcgtcaga 
taatctgctg 
aagagctacc 
ctgtccttct 
catacctcgc 
ttaccgggtt 
ggggttcgtg 
agcgtgagct 
taagcggcag 
atctttatag 
cgtcaggggg 
ccttttgctg 



tgaattgatc 
gcataacctt 
ttttgataat 
ccccgtagaa 
cttgcaaaca 
aactcttttt 
agtgtagccg 
tctgctaatc 
ggactcaaga 
cacacagccc 
atgagaaagc 
ggtcggaaca 
tcctgtcggg 
gcggagccta 
gccttttgct 



atatgcggat 3120 
tttccgtgat 3180 
ctcatgacca 3240 
aagatcaaag 3300 
aaaaaaccac 3360 
ccgaaggtaa 3420 
tagttaggcc 3480 
ctgttaccag 3540 
cgatagttac 3600 
agcttggagc 3660 
gccacgcttc 3720 
ggagagcgca 3780 
tttcgccacc 3840 
tggaaaaacg 3900 
cacatgaccc 3960 

3964 



<210> 74 
<211> 3859 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid, pOTP72 



<400> 74 

ccatcgaatg 

tattttacca 

agaaaaggag 

ttgtgcgata 

get aaagaag 

gaagaaaatg 

tttcatttag 

gtaaataatg 

ggcaagcatt 

aaaagegcta 

aaaagcacta 

catttageca 

tttttacccc 

aaaaaagcag 

catggtctca 

tgaaaaatgg 

eggat ctcca 

gcgtgaccgc 

ttctcgccac 

tccgatttag 

gtagtgggcc 

ttaatagtgg 

ttgatttata 

aaaaatttaa 

ggggaaatgt 

cgctcatgag 

gtattcaaca 

ttgctcaccc 

tgggttacat 

aacgttttcc 

ttgaegcegg 

agtactcacc 

gtgetgecat 

gaccgaagga 

gttgggaacc 



gecagatgat 
ctccctatca 
gaatttaaat 
taggcaacac 
atttaaagcg 
aaaaagcect 
aaaccgacta 
gcgtggtggt 
taggagggtg 
aaattttaga 
gagaegctgt 
aaaatcaaaa 
attctgtttg 
ggatactaga 
gcgcttggag 
cgcacattgt 
cgcgccctgt 
tacacttgcc 
gttcgccggc 
tgetttaegg 
atcgccctga 
actcttgttc 
agggattttg 
cgcgaatttt 
gcgcggaacc 
acaataaccc 
tttccgtgtc 
agaaaegctg 
cgaactggat 
aatgatgagc 
gcaagagcaa 
agtcacagaa 
aaccatgagt 
gctaaccgct 
ggagctgaat 



taattcctaa 
gtgatagaga 
gecagctagg 
gegtatccat 
tttgggtatt 
tttgaattgt 
tgtagggctt 
ggatgccggg 
tattttaccc 
gcaacct ttc 
gaattaegge 
aatctatctt 
caaggagegt 
atgcaaataa 
ccacccgcag 
gcgacatttt 
ageggegcat 
agcgccctag 
tttccccgtc 
cacctcgacc 
tagacggttt 
caaactggaa 
ccgatttegg 
aacaaaatat 
cctatttgtt 
tgataaatgc 
gcccttattc 
gtgaaagtaa 
ctcaacagcg 
acttttaaag 
ctcggtcgcc 
aagcatctta 
gataacactg 
tttttgeaca 
gaagecatae 



tttttgttga 
aaagtgaaat 
caatct ttta 
tttgcacaaa 
caaaaggaaa 
taccctaacg 
gggatagacc 
agtgcgatta 
ggtttagccc 
aaggecttag 
atggttttga 
tgtgggggcg 
ttggtttttg 
ggatccctcg 
ttcgaaaaat 
ttttgtctgc 
taagegegge 
cgcccgctcc 
aagctctaaa 
ccaaaaaact 
ttcgcccttt 
caacactcaa 
cctattggtt 
taaegcttae 
tatttttcta 
ttcaataata 
ecttttttge 
aagatgctga 
gtaagatcct 
ttctgctatg 
gcatacacta 
eggatggcat 
cggccaactt 
acatggggga 
caaacgacga 



cactctat ca 
gaatagttcg 
cagatt tgaa 
actatcagct 
ttttttacat 
ctaaaaat a t 
ggcaaatggc 
cgatagatt t 
aatatattca 
attctt taga 
gcgtcattgc 
atgcgaagta 
acgggatgga 
aggtcgacct 
aataagcttg 
cgtttaccgc 
gggtgtggtg 
tttegcttte 
tegggggetc 
tgattagggt 
gacgttggag 
ccctatctcg 
aaaaaatgag 
aatttcaggt 
aatacattca 
ttgaaaaagg 
ggcattttgc 
agatcagttg 
tgagagtttt 
tggcgcggta 
ttctcagaat 
gacagtaaga 
acttctgaca 
tcatgtaact 
gcgtgacacc 



ttgatagagt 

acaaaaatct 

aaacctggtt 

ctttt caagc 

tagcgtgaat 

tgcagggttt 

gtgtctggcg 

aatcaaagag ■ 

tgcgtataaa 

agttttacct 

ttgtatccag 

tttgagcgcg 

aategctett 

gcagggggac 

acctgtgaag 

tactgegtea 

gttacgegea 

t tcccttcct 

cctttagggt 

gatggttcac 

tccacgttct 

gtctattctt 

ctgatttaac 

ggcacttttc 

aatatgtatc 

aagagtatga 

cttcctgttt 

ggtgcacgag 

cgccccgaag 

ttatcccgta 

gacttggttg 

gaattatgea 

aegateggag 

cgccttgatc 

aegatgectg 
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tagcaatggc aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc 2160 
ggcaacaatt gatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg 2220 
cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt ggctctcgcg 2280 
gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga 2340 
cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac 24 00 
tgattaagca ttggtaggaa ttaatgatgt ctcgtttaga taaaagtaaa gtgattaaca 24 60 
gcgcattaga gctgcttaat gaggtcggaa tcgaaggttt aacaacccgt aaactcgccc 2520 
agaagctagg tgtagagcag cctacattgt attggcatgt aaaaaataag cgggctttgc 2580 
tcgacgcctt agccattgag atgttagata ggcaccatac tcacttttgc cctttagaag 2640 
gggaaagctg gcaagatttt ttacgtaata acgctaaaag ttttagatgt gctttactaa 2700 
gtcatcgcga tggagcaaaa gtacatttag gtacacggcc tacagaaaaa cagtatgaaa 2760 
ctctcgaaaa tcaattagcc tttttatgcc aacaaggttt ttcactagag aatgcattat 2820 
atgcactcag cgcagtgggg cattttactt taggttgcgt attggaagat caagagcatc 2880 
aagtcgctaa agaagaaagg gaaacaccta ctactgatag tatgccgcca ttattacgac 2940 
aagctatcga attatttgat caccaaggtg cagagccagc cttcttattc ggccttgaat 3000 
tgatcatatg cggattagaa aaacaactta aatgtgaaag tgggtcttaa aagcagcata 3060 
acctttttcc gtgatggtaa cttcactagt ttaaaaggat ctaggtgaag atcctttttg 3120 
ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 3180 
tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 3240 
aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 3300 
tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 3360 
agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 3420 
taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 3480 
caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 3540 
agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag 3600 
aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 3660 
gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 3720 
tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 3780 
gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 3840 
ttgctcacat gacccgaca 3859 

<210> 75 
<211> 3934 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid, pOTP73 
<400> 75 

ccatcgaatg gccagatgat taattcctaa tttttgttga cactctatca ttgatagagt 60 
tattttacca ctccctatca gtgatagaga aaagtgaaat gaatagttcg acaaaaatct 120 
agaaaaggag gaatttaaat gattcttgag ctcgactgtg gaaactcgct gatcaagtgg 180 
cgggtcatcg agggggcggc gcggtcggtc gccggtggcc ttgcggagtc cgatgatgcc 240 
ctggtcgaac agttaacgtc gcagcaagcg- ctgccagtgc gagcctgtcg cctggtgagc 300 
gttcgcagcg agcaggaaac ctcgcaactg gtcgcacggt tggagcagct gttcccggtt 360 
tcggcgctgg ttgcatcatc cggcaagcag ttggcgggtg tgcgcaacgg ctatctcgat 
taccagcgcc tggggctcga ccgctggctg gccctcgtcg cggctcatca cctggctaag 
aaggcctgcc tggtcattga tctggggacc gcggtcacct ctgacctggt cgcggcggat 54 0 
ggagtgcatc tggggggcta catatgcccg ggcatgaccc tgatgagaag ccagttgcgc 600 
acccataccc gacgtatccg ctacgacgat gcagaggccc ggcgggcgct tgccagtctc 660 
cagccagggc aggccacggc cgaggcggtt gagcggggtt gtctgctcat gctcaggggg 720 
ttcgttcgtg agcagtacgc catggcgtgc gagctgctcg gtccggattg tgaaatattc 
ctgacgggtg gggatgccga actggttcgc gacgaactgg ctggcgcccg gatcatgccg 
gacctggttt tcgtagggct ggcactggct tgcccgattg agtaaggatc cctcgaggtc 
gacctgcagg gggaccatgg tctcagcgct tggagccacc cgcagttcga aaaataataa 960 
gcttgacctg tgaagtgaaa aatggcgcac attgtgcgac attttttttg tctgccgttt 1020 
accgctactg cgtcacggat ctccacgcgc cctgtagcgg cgcattaagc gcggcgggtg 1080 
tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg 1140 
ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg 1200 
ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt 1260 



420 
480 



780 
840 
900 
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agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt 1320 
tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta 1380 
tctcggtcta ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa 1440 
atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt 1500 
caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac 1560 
attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa 1620 
aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat 1680 
tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc 1740 
agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga 1800 
gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg 1860 
cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc 1920 
agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag 1980 
taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc 2040 
tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg 2100 
taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg 2160 
acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac 2220 
ttactctagc ttcccggcaa caattgatag actggatgga ggcggataaa gttgcaggac 2280 
cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg 2340 
agcgtggctc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg 2400 
tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg 24 60 
agataggtgc ctcactgatt aagcattggt aggaattaat gatgtctcgt ttagataaaa 2520 
gtaaagtgat taacagcgca ttagagctgc ttaatgaggt cggaatcgaa ggtttaacaa 2580 
cccgtaaact cgcccagaag ctaggtgtag agcagcctac attgtattgg catgtaaaaa 2640 
ataagcgggc tttgctcgac gccttagcca ttgagatgtt agataggcac catactcact 2700 
tttgcccttt agaaggggaa agctggcaag attttttacg taataacgct aaaagtttta 2760 
gatgtgcttt actaagtcat cgcgatggag caaaagtaca tttaggtaca cggcctacag 2820 
aaaaacagta tgaaactctc gaaaatcaat tagccttttt atgccaacaa ggtttttcac 2880 
tagagaatgc attatatgca ctcagcgcag tggggcattt tactttaggt tgcgtattgg 2940 
aagatcaaga gcatcaagtc gctaaagaag aaagggaaac acctactact gatagtatgc 3000 
cgccattatt acgacaagct atcgaattat ttgatcacca aggtgcagag ccagccttct 3060 
tattcggcct tgaattgatc atatgcggai tagaaaaaca acttaaatgt gaaagtgggt 3120 
cttaaaagca gcataacctt tttccgtgat ggtaacttca ctagtttaaa aggatctagg 3180 
tgaagatcct ttttgataat ctcatgacca aaatccctta acgtgagttt tcgttccact 3240 
gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt tttctgcgcg 3300 
taatctgctg cttgcaaaca aaaaaaccac cgc.taccagc ggtggtttgt ttgccggatc 3360 
aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag ataccaaata 3420 
ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta gcaccgccta 3480 
catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc 3540 
ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg ggctgaacgg 3600 
ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg agatacctac 3660 
agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac aggtatccgg 3720 
taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga aacgcctggt 3780 
atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt ttgtgatgct 3840 
cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta cggttcctgg 3900 
ccttttgctg gccttttgct cacatgaccc gaca 3934 



SEQ ID NO:69 Pseudomonas syringae pv tomato coaX coding sequence 

atgattcttgagctcgattgcggcaacagctttatcaagtggcggataatcacaaagagttgctcaacgttggtcag 
cggcggagtagtggactcggacacagccttgctagagtgcctgggcaatctgtcaggcgcagcattcagcgattgcc 
gtctggtaagcgttcgtagcgcggaagaaacggcgaagctggtttgcgcgctggcagataccttttccattagccct 
gtctgtgcagcgccggcgccagagcttgccggggtaatcaatggatacgacgattttgcacgcttggggctggatcg 
ctggttggcatttgtaggggcttaccaccttgttaagggtgcctgcctggtgatcgatctgggcaccgccattacgt 
ctgactttgttgaagcgtcaggaaagcatctgggtggtttcatctgtcctggcatgccactgatgcgcaatcagctg 
cgtacccacacccgtcgcattcgatatgacgatgcagaggctgaaaaagccctggtacgactcgtgcctggccgtgc 
gacggccgaggctgtggagcgaggttgtt etc teat get tcgcggattcgcaatgactcagatcgagatagctcgcg 
aatactggggggacgactttgctattttcgtgacaggaggcgacgctgtcttggttgctgatgtgttaccgggcgct 
cgcattgtccctgatttggtattcgttggcctggctctcgcttgccctttacgttga 

SEQ ID NO:70 Pseudomonas syringae pv tomato coaX protein sequence 
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MILELDCGNSFIKWRIITKSCSTLVSGGVVDSDTALLECLGNLSGAAFSDCRLVSVRSAEETAKLVCALADTFSISP 
VCAAPAPELAGVINGYDDFARLGLDRWLAFVGAYHLVKGACLVIDLGTAITSDFVEASGKHLGGFICPGMPLMRNQL 
RTHTRRIRYDDAEAEKALVRLVPGRATAEAVERGCSLMLRGFAMTQIEIAREYWGDDFAIFVTGGDAVLVADVLPGA 

RI VPDLV FVGLALAC PLR 
SEQ ID NO: 71 - pAN296 

TGCGCCGCTACAGGGCGCGTCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCT 
ATTACG CCAG CTGG CGAAAGGGGGATGTG CTGCAAGG CGATTAAGTTGGG TAACG C CAGGGTTTT CC CAGTCACGACGTT 
GTAAAACG ACGG C CAGTGAATTGTAAT ACGACTCACT ATAGGG CG AATTGGG C CCG ACGTCg catgc TGGATGAAAAG CC 
GATG ACCG CTTTTCAGGT CTGTCAG CAGCTTTTT C CTG CTGTATATGAAAAGGAATTGTTTTTAACGATGT CAGAAACGG 
CAGGTCACCTTGATGTGTTGG AGG CTGAAGAAGC CAT CACGT CAT ATTGGGAAGGAAATAC CGTATACTTTAAAACAATG 
AAGAGG TGAAATGGGTGAAACATATAGCGGGAAAAAGGATTTGGAT AAC CGG CGCTT CAGG AGGGCTTGGAGAAAG AATC 
G CATACTTATG CG CGG CTGAAGGAG CC CATG T C CTG CTGTCGG CTAGACG CGAGG ATCGTTTGATAGAAAT CAAAAGG AA 
AATAAC CGAGGAATGG AG CGGACAGTG TGAGATTTTT CCT CTGGATGT CGG C CGC CT AGAGG ATAT CG CCCGGGT C CG CG 
AT CAGATCGG CT CG ATTGATGTAC TG ATT AACAATG CAGG CTTCGGTATATTTGAAACGGTTTTAGACTCT ACATTGGAT 
GACATGAAAGCGATGTTTGATGTGAATGTCTTCGGCCTGATCGCCTGTACAAAAGCGGTGCTTCCGCAAATGCTTGAGCA 

AAAA*^AGGGACAT AT CAT CAATATCG C CT C T CAAG CGGGG AAAATCG C CACAC CGAAGT CT AG CCTGT ATT C CG CGAC CA 
AACATGCCGTGTTAGGTTACTCAAACGCTTTGCGGATGGAGCTTTCGGGAACCGGCATTTATGTGACAACAGTCAACCCG 
GGCCCGATTCAGACGGACTTTTTTTCCATTGCTGATAAAGGCGGGGACTACGCCAAAAATGTCGGCCGCTGGATGCTTGA 
T C C TGATG ACGTGG CAG CT CAAATT ACAG CTG CAATTTTTACGAAAAAG CGGGAGATCAATC TT C CGCGTTT AATGAATG 
C CGG CACTAAG CTGTATCAG CTGTTTC CAG CT CT TG TAGAAAAG CTGG CAGGACG CG CGCT CATGAAAAAATAATGATAG 
AACTGCCTGTGGTGGAGTGGCTTGTTTCTCACGGGGCAGTTTTTGATAGTGGAAGGGAGAGATTGTTGAATGTCAGTTCA 
TTCAGAAGTCCTTCATGCTCTGCTTAAAGATCCGTTTATTCAGAAACTGATTGATGCAGAGCCTGTATTCTGGGCAAATT 

CAGG CAAGAAAG AGGGG C CATT AC C C CGTG CAGATGAGTGGG CAAC CG AGAT AG CGGAAG CGGAAAAAAGAATG CAG CGG 
TTTGCACCTTACATTGCCGAGGTGTTTCCTGAGACGAAAGGCGCTAAAGGAATCATCGAGTCTCCGCTTTTTGAGGTGCA 
GCATATGAAGGGAAAGCTGGAAGCGGCATATCAGCAGCCATTTCCCGGAAGATGGCTTTTAAAGTGCGACCATGAGCTTC 
CGATTT CAGG ATCGATTAAAGCG AGGGG CGGGATTTATGAAGTGTTAAAGTATGCTGAAAATCTCG CGCTT CAAG AAGG A 
ATGCTTCAGGAAACCGATGATTACCGCATCTTACAGGAAGAGCGGTTTACCGGGTTTTTCTCCCGCTATTCGATTGCTGT . 
CGGTTCGACAGGAAATCTAGGTTTAAGCATCGGCATCATCGGCGCGGCACTCGGGTTTCGCGTGACAGTGCATATGTCCG- 

C CG ATG C TAAG CAG TGGAAAAAGG AT C T C C T C CG C CAAAAGGG AG T CAC TGTT ATGGAGT ACG AAACAG AT T ACAG TG AA 
GCGGTGAACGAAGGGAGACGGCAGGCGGAACAAGATCCATTCTGTTATTTTATTGATGATGAACATTCTCGTCAGCTGTT ^ 

CTT AGG AT ATG CTGTTGCTG CAAG CCG ATTAAAAAC AC AG CTTGACTGTATGAATATAAAG C CAAGT C TTG AGACG C C C T 
TGTTTGTGTATCTGCCGTGCGGAGTCGGCGGAGGACCGGGCGGTGTAGCATTTGGGCTGAAGCTTTTATACGGAGATGAT 
GTTCATGTGTTTTTCGCAGAACCAACTCATTCACCTTGTATGCTGTTAGGGCTTTATTCAGGACTTCACGAGAAGATCTC 
CGTC CAGG AT AT CGG C CTGG AT AATCAGACGG CTG CTG ACGG ACTTGCCGTAGGGAGGCCGTCAGGATTTGT CGG CAAG C 
TGATTGAACCGCTTCTGAGCGGCTGTTATACGGTAGAGGACAATACGCTTTATACTTTGCTTCATATGCTGGCTGTATCT V 
GAAGATAAATATTTAGAGC CCT CTG CTC TTG CTGG CATGTTCGGGC CGG TT CAG CTTTTTTCGACAGAAGAGGGAAGGCG J • 

CTATG CT CAGAAATAT AAGATGGAAC ATG C CGTACATG T CG T CTGGGGAACGGG AGG AAG CATGGTTC CAAAAGATGAAA ^ • 
TGGCTGCGTATAACCGAATCGGTGCTGATTTGCTAAAAAAACGAAATGGAAAATAAGCAGACAGTGAAAAGGTTTTCCGT 
TACAATCTTTGTAAGGGTTTTAACCTACAGAGAGTCAGGTGTAAACAGTGAAAAATAAAGAACTTAACCTACATACTTTA „ v 

T ATACACAG CACAAT CGGG AGT CTT ctgcagctcgag c AATAGT TAC CCT TAT TAT CAAGAT AAGAAAGAAAAGGATTTT 
TCGCTACGCTCAAATCCTTTAAAAAAACACAAAAGACC^CATTTTTTAATGTGGTCTTTATTCTTCAACTAAAGCACCC^ 
TT AGTTCAACAAACGAAAATTGG ATAAAGTGGGAT ATTTTT AAAATAT ATATTTATGTTACAGTAATAT TG ACTT T T AAA 
AAAGGATTGATT C TAATG AAG AAAG CAGACAAGT AAG C CT C CT AAAT T CAC T TTAG ATAAAAATTTAGGAGG CAT AT CAA 
ATG AACTT TAATAAAATTGATTT AGACAATTGGAAG AG AAAAG AGATATT TAAT CATT ATTTG AACCAACAAACGACTTT 
TAGT ATAAC CACAGAAATTG ATATTAGTGTTTTATAC CGAAACATAAAACAAGAAGG ATATAAATTTTAC C CTG CATTT A 
TTTT CTT AGTGACAAGGG TGATAAACT CAAATACAG CTTTTAG AAC TGG TT ACAATAG CGACGGAG AG TTAGG TT AT TGG 
GATAAGTTAGAGCCACTTTATACAATTTTTGATGGTGTATCTAAAACATTCTCTGGTATTTGGACTCCTGTAAAGAATGA 
CTTCAAAGAGTTTTATGATTTATACCTTTCTGATGTAGAGAAATATAATGGTTCGGGGAAATTGTTTCCCAAAACACCTA 
TACCTGAAAATGCTTTTTCTCTTTCTATTATTCCATGGACTTCATTTACTGGGTTTAACTTAAATATCAATAATAATAGT 

AATTAC CT T CTACCCATTATTACAG CAGG AAAAT TCATT AATAAAGG T AATT CAATATATTTACCG CTATC TTTACAGGT 
ACATCATTCTGTTTGTGATGGTTATCATGCAGGATTGTTTATGAACTCTATTCAGGAATTGTCAGATAGGCCTAATGACT 
GGCTTTTATAAT ATG AGAT AATGCCGACTGTACTTTTTACAGTCGGTTTTCTAATGTCACTAAC CTG CCCCGTTAGTTGA 

AGAAGGT TTTT AT ATTACAGCTg t eg a c TAAGG T CGAGGAAGTGTTGGT AAGGAGGGT ATGAAATGTG CAT CATATTGAA 
CTGTATGTCTCTGATTTGGAGGCGTCTAGGCGGTTTTGGGGCTGGTTCTTAAAAGAACTTGGTTATAAAGAGTATCAAAA 

ATGGAGCTCAGGCAT CAG CTGGAAGAAAGAT CGTTTTTACCTAG TGATTGTG CAGGCGAAAGAGCCATTTCT AGAGC CGG 
AAT AC CAT AG ATG CCGAGTCGGTCTGAACCATCTCG CATTT CATG CTGAATCCAAGCTTCAAGTCGATCAGATGACTGAA 
AAATTGACGGCAAAAGG C TATCGTG TGTTGTACCGAGACAGG CATCCTTTTG CCGG AGGAGACGGG CATTATG CAGTCTT 
TTGTG AGGAT CCAGAC CGGATTAAGG TAG AG CTCGTTG CC CCAAGCTGTTAATCGTGAT CTT CGG ACAGG CTGTTCAG C T 
TTTTCTCAATGCGATC CAG CTGCGCTTTTCGGTTTTTCG CAT ACTTG AAG CCTGT AACAGCCGCAAAGACG ACAG CGG CA 

AAT ATAATAAAT ACAAACAG CTGAAACATCACAT CACCTAT ATT CATGTTCTTCAC CT CATG TTTGCGGGAG AGATTCAT 
TCTCTTCCGTTTTTTATTTAAAGCGGCTTTTCCAGACGGGAACGGTGTTTTGTGGTCTCCATTTTCATTTGCCGATAGGC 
GAACGCTAAAAATGG CAGGC CGAG CAGGGTAATG CCG CTCAGG ACAGAAAAAATATAAATCGG CCGGC CAG CGCCAAACA 
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GGTCTATACATATCCCCCCGACCCAAGGGC CGATGACG TTTCCGAGCTGTGGAAAACCG ATTG CCC CGAAATAAGTG CCT 
TTT AAT CCTGGTTTTG CAATCTGGTCTACAT ACAAAT CCATCAT AG AGAATAAAAG CACTTCG CCGATTGTAAATGTGAT 
GACAATCATCAC^TTGATGGAACACCGTGTGATACGGTGAAAATGG CCATG CTGATG CT AACCATCACATTACCG AG CA 
TCAGAGAACAAAG CGG CGAAAACCG TTTTG CAAAATGGACAATGGGAAATTG CG T CGCCAACACAACG ATTG CG TTT AAT 
GTCAGCATCAGCCCATACAGCTTCGTTCCTVrTGCCGAT 

TTGTGAG TAG CCGAAGGTG CAT AG CGTAATGC CGACCAAAGCAATGGTAAAAAGATAATC CTTTTG CGTGAC CATAAACG 
CTTCCCGCACGCTCATATTTCGGGACTGGGCTGGTGCTGATAAGGATGGATGTTTTTTAAATTGGA^ 
CCGTATAGTC CGTAAATGACTG CAGG CAC CAAAAAGGGCGTAGTCGATTG CGATGAG C CGAAATATAGGCCAAG CACAGG 
T CCGAAGACAACG CCGATATTAATAGCCGCATAG CG T AAATTAAAAACTAG CAGT CTCGTTTTTTCTTCTGTCAT AT CAG 
ACAACAAGGC CTTTGAAGCGGG CT CAAACAGTGATTTG CAAAGACCGTTT AATG CGTTTACTACAAAAAACACC CAGAG A 
TTAGATG CTGCCGCAAAG CCTG CAAATACCAG CATCCATC CGAAAATCG ATACAAG CAT CATGTTTTTTCTG C CGAATTT 
ATCTG AGATAT AT CCGC CGTAAAAG CTTG CGAGG ATGC CGACTG ATGAGCT CG CGG CGATGAC CAGCC CTG CATAGGAAG 
CTGATG CG C CTTGGACGG CTGT CAAATAAAT CG CTAAAAAAGGAATG CT CATCG ATGTTG C CATT CTG CCG AAAATGG TT 
CCGATTATAATTGTAa eg eg t TGGATGCATAGCTTGAGTATTCTATAGTGTCACCTAAATAGCTTGGCGTAATCATGGTC 
AT AG CTGTTT CC TGTGTGAAATTG TT AT CCG CT CACAATT C CACACAACATACGAG C CGGAAG C AT AAAGTGTAAAG CCT 
GGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGC 
CAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGA 
CTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAG 
GGGATAACGC^GGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGT^ 

TT CG AT AGG CT C CG C C C C C CTG ACG AG CATCACAAAAAT CG ACG CT CAAGTCAGAGG TGG CG AAAC CCGACAGGACTATA 
AAGATAC CAGG CG TTT CCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCG 
CCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCC 
AAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCC 

GGTAAGACACGACTT ATCG C CACTGG CAG CAG C CAC TGGT AACAGGATTAG CAGAG CGAGG TATG TAGG CGGTG CTACAG 
AGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACC 

TTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTT^ 

GATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACT 
CACGTTAAGGGATT TTGGT CATGAGATTAT CAAAAAGG AT CTT CAC CT AG AT C CTTTT AAAT T AAAAATG AAGTTTT AAA 
T CAAT C TAAAG T ATAT ATGAG TAAACTTGGT CTG ACAGTT AC CAATG CTTAAT CAGTG AGG CAC C TAT CT CAG CG AT CTG 
-TCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCA 

GTG C TG CAATG ATAC CGCGAG AC C CACG C T CAC CGG CT C CAGAT TT ATCAG CAAT AAAC CAG C CAG C CGG A AGGG C CG AG 

CGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCC 

AGTTAATAGTTTGCGCAACGTTGTTGGCATTGCTACAGG CATCG TGGTGT CACG CTCGTCGTTTGGTATGG CTT CAJ 

GCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCG 

ATCGTTGTC^GAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCC 

ATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATACCGCGCCCGGCGACCGAGTTGCT 

CTTGCCCGGCGTCAATACGGGATAATAGTGTATGACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTJCG 

GGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGC 

AT CTT T TACT T T CAC CAG CGTTT CTGGG TG AG CAAAAACAGGAAGG CAAAATG C CG CAAAAAAGGGAATAAGGG CG ACAC 
GGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATAC 
ATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGTATGCGGTGT 
G AAAT AC CG CACAGATG CG T AAGG AGAAAAT AC CG CATCAGG CGAAATTG TAAACGTT AATAT T TTG TTAAAATTCG CGT 
TAAAT AT TTG T TAAAT CAG C T CATTTTTTAACC AATAGG C CGAAAT CGG CAAAAT C C CTTAT AAAT CAAAAGAATAGAC C 
GAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCC^CTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAA 
AACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCAAATCAAGTTTTTTGCGGTCGAGGTGCCGTAAAGCTC 
TAAATCGG AAC C CTAAAGGGAG C CCC CGATTTAG AG C TTGACGGGGAAAG C CGGCGAACGTGG CGAGAAAGGAAGGG AAG 
AAAG CG AAAGGAG CGGGCG CTAGGG CG C TGG CAAGTGT AG CGGT CACG CTG CG CGTAAC CAC CACAC C CG CCG CG C TT AA 
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TGCGCCGCTACAGGGCGCGTC CATT CGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCT CTT CGCT 
ATTACGCCAGCTGGCGAAAGGGGGATGTG CTG CAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCC CAGT CACGACGTT 

GT AAAACGACGG CCAG TGAATTGTAATACGACT CACTAT AGGG CGAATTGGG C C CG ACGT CG CATG CAC CAGG CTT CT CA 
GGCGCTGACTTAGAAAACCTCTTGAATGAAGCTGCGCTTGTAGCGGCTCGTCAAAACAAGAAAAAAATCGATGCGCGTGA 

T ATTGACGAAGCG ACGGAC CGTGTAATTG C CGGAC CCG CTAAG AAG AGC CG CGT TATCTCCAAG AAAGAACG CAAT ATCG 
TGG CTTATCACGAAGGCGGACACAC CGTTATCGGTCT CGTTTTAG ATGAGG CAG ATATGGTT CATAAAGTAACG ATTGTT 
CCTCGGGGCC^GGCTGGCGGTTATGCTGTTATGCTGCCAAGAG^^ 

TAAAATTGTCGGCCTCTTGGGCGGACGTGTTGCTGAAGAGATTATCTTCGGTGAAGTCAGCACAGGGGCGCACAATGACT 

TCCAGCGTGCGACGAATATTGCAAGACGAATGGTTACAGAATTCGGTATGTCAGAAAAACTGGGACCGT^^ 

CAGT CTCAGGG CGGTCAGGTATTCTTAGG CCGTGATTT CAACAACG AACAGAACTACAGTGATCAAATCG C TTACGAAAT 

TGATCAGGAAATTCAG CG CATCATCAAAGAATGTT ATGAGCGTG CG AAACAAAT C C TGACTGAAAAT CGTGACAAG CT TG 

AATTGATTGC CCAAACG CTTCTGAAAGTTGAAACG CTTGACGCTGAACAAATCAAACACCTTAT CG ATCATGGAACATTA 

CCTG AG CGTAATTT CT CAGATG ATGAAAAG AACGATGATG TGAAAGTAAACATTCTGACAAAAACAGAAGAAAAGAAAGA 

CGATACX3AAAG AGTAATTCGCTTTCTTTCTAAAAAAACTG CCGG CTGACG CTGG CAGTTTTTTTATGTAAATG ATTGG CT 
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CAG CTG CGG CTTTT ACAAT CAT C CAATT CTGG T AT CG ATT TG TTT ACAAATG AG C CG CTG AT CG TG T ATGG TAT TG TAG A 
ATGTTTGTAAAAAGTAAAGTAGAGAAACTATTCAAAAGTGGTGATAGAGGTTGTTACTGGTTATCGATGTGGC^ 
CTG CAGCTCGAGTG AAATACCGCACAGATG CGTAAGGAGAAAATACCG CAT CAGGCGATAAACC CAGCG AACCATTTGAG 
GTGATAGGTAAGATTATACCGAGGTATGAAAACGAGAATTGG ACCTTTACAGAATTACTCTATG AAG CG CCATATTTAAA 
AAG CTACCAAG ACGAAG AGGATG AAGAGGATGAGG AGG CAG ATTG CCTTGAAT AT ATTG ACAATACTGATAAG ATAATAT 
ATCTTTT ATATAGAAGATATCGCCGTATGT AAGGATTTCAGGGGG CAAGG CATAGG CAG CGCGCTTAT CAATATATCTAT 
AGAATGGG CAAAGCATAAAAACTTG CATGGACTAATGCTTGAAACCCAGG ACAATAACCTT ATAGCTTGT AAATTCTAT C 
ATAATTGTGGTTTCAAAATCGGCTCCGTCGATACTATGTTATACGCCAACTTTCAA 

TGGTATTTAAGGTTTT AGAATGCAAGGAACAGTGAATTGG AGTTCG TCTTGTTAT AATTAG CTTCTTGGGGT AT CTTTAA 
AT ACTG T AG AAAAG AGG AAGG AAAT AAT AAATGG C T AAAATG AG AAT AT CAC CGG AAT TG AAAAAAC TG AT CG AAAAAT A 
CCG CTG CGT AAAAGATACGGAAGGAATGTCT CCTGCT AAGGTATATAAG CTGGTGGGAGAAAATGAAAAC CT ATATTT AA 
AAATG ACGG ACAG C CGG T AT AAAGGG AC CA C C T ATG ATG TGG AACGGG AAAAGG ACATG ATG C T ATGG C TGG AAGG AAAG 
CTG C CTGTT CCAAAGGTCCTG CACTTTGAACGG CATGATGG CTGGAG CAATCTG C T CATGAG TGAGG C CGATGG CGT C CT 
TTG CTCGG AAGAGTATGAAGATGAACAAAG C CCTGAAAAGATTATCGAGCTGTATG CGG AGTGCATCAGG CT CTTTCACT 
CCAT CGACAT ATCGGATTGTCC CT AT ACGAATAGCT TAGACAG CCG C TT AG C CGAATTGGATTACTT ACTG AAT AACGAT 
CTGG C CGATG TGGAT TGCGAAAACTGGGAAGAAGACACT C CATTTAAAGATC CG CG CG AG CTG TATG ATTTTTTAAAGAC 
GGAAAAGCC CGAAG AGGAACT TGT CTTTTC C CACGG CG AC CTGGGAGACAG CAACATC T TTGTGAAAGATGGCAAAGTAA 
GTGG CTTT ATTGAT CTTGGGAGAAGCGGCAGGG CGG ACAAGTGG T ATGACATTGC CTT CTG CGT C CGGT CGAT CAGGGAG 
G ATAT CGGGGAAG AACAG TATGTCGAG CT ATTTTTTG ACTTACTGGGG AT CAAG C CTGATTGGG AG AAAATAAAATATT A 
TATTTTACTGGATGAATTGTTTTAGTACCTAGATTTAGATGTCTAAAAAGCTTTAACTACAAGCTTTTTAGACATCTAAT 
CTTTTCTGAAGTACATCCGCAACTGTCCATACTCTGATGTTTTATATCTTTTCTAAAAGTTCGCTAGATAGGGGTCCCGA 
G CG C C T ACGAGGAAT TTGTAT CG C CATT CG C CATT CAGG CTGCG CAACTGTTGGGAAGGG CGAT CGGTG CGG T CGACTGG 
CAGGCAAAACAGG AC C CAAGGT CATTG CG ACAGGAGGC CTGG CG C CGCT CATTG CGAACGAATCAGATTGTATAGACAT C 
GTTG AT C CATT C TTAACC CT AAAAGGG C TGG AATTGAT TTATG AAAGAAAC CG CGT AGG AAGTGTATAGGAGGTTTAGTA 
ATGG ATT ATTT AG TAAAAG CACTTGCGTATGACGGAAAAGTTCGGG C T TATG CAG CGAGAACGACTGAT ATGGTAAATGA 
GGGGCAGAGACGCCATGGTACGTGGCCGACAGCATCCGCTGCACTAGGCCGTACAATGACAGCTTCACTTATGCTCGGCG 
CTATGCTGAAGGGCGATGATAAGCTGACCGTGAAAATCGAGGGCGGAGGTCCGATCGGAGCTATTGTAGCTGATGCCAAT 

G C CAAAGGAG AAGTCAG AG C C TATGT CT CTAAC C CG CAAGTT CATTTTGATTTAAATGAACAAGGT AAG CTTG ATGT CAG 
ACGTGCGGTTGGAACAAACGGAACGTTAAGTGTCGTAAAAGATTTAGGTTTGCGCGAGTTCTTCACAGGACAAGTAGAAA 
TCGTTTCAGGAGAATTAGGAGATGATTTTACTTACTATCTTGTGTCATCTGAGCAGGTTCCTTCATCAGTGGGCGTAGGT 
GTGCTCGTAAATCCTGACAATACCATTCTTGCGGCAGGGGGCTTTATTATTCAGCTGATGCCGGGAACAGATGATGAAAC 
AAT CACAAAAAT TGAACAG CG T C TAT C T CAAGT AG AG C CG ATTT CT AAGCTCAT C CAAAAAGGG C TG ACAC C AGAAG AAA 
T T TT AG AAGAAGT C C T AGG CGAG AAAC CTG AGATTT TGGAAACGATG C CTGT CAGATT C CATTG C C CTTG TT CAAAAGAA 
CGGT TCG AAAC AG C CATT TTAGGACTAGG CAAAAAAG AAATTCAAGATATGATAG AAG AAG ATGGACAAG C CGAAG CAGT 
ATG CCATTTTTGTAATGAAAAGTACTTATTTACAAAAGAAGAGCTGGAAGGGCTTCGTGACCAAACTACC CGCT AAGCTC 
TT T AG CGGGTTTTTAATTTG AG AAAAGGGG CTG AAAG CAGGTTTG AAAT CAAG AACAAT CTGG ACG CGT TGGATGCAT AG 
CTTGAGTATTCTATAGTGTCACCTAAATAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGC 
T CACAAT T C CACACAACATACG AG CCGGAAG CAT AAAGTGTAAAGC CTGGGGTG CCTAATGAGTG AG CTAACT CAC ATTA 
ATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGG 
GAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAG 
CGGT AT CAG CT CACT CAAAGG CGGT AAT ACGGTT AT CCACAGAAT CAGGGG ATAACG CAGGAAAG AACATG TGAGCAAAA 
GGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCGATAGGCTCCGCCCCCCTGACGAGCATCA 

CAAAAATCGACG CT CAAGT CAGAGGTGG CGAAAC CCGACAGG ACTATAAAG ATACCAGG CGTTT C C C C CTGG AAG CT C CC 
TCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCT 
CATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCA 
GCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAG 

C CACTGGTAACAGG ATTAG CAG AG CGAGG TATGT AGG CGG TGCTACAGAGT T C TTGAAGTGG TGG C C TAACTACGG CTAC 
ACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGG 
CAAAC^U^CCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAG 
ATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCA 

AAAAGGAT C TT CAC C TAGAT C CTTTTAAATTAAAAATGAAGTTTTAAAT CAAT CTAAAGTATATATGAG TAAACTTGGTC 
TGACAGTTACCAATGCTTAATCAGTGAGG CAC CTATCT CAG CGAT CTG TCTATTTCGTT CATC CAT AGTTGCCTGACTCC 
CCGTCGTGTAG ATAACTACGATACGGGAGGG CTTAC CATCTGG CC C CAGTG CTGCAATG ATAC CG CGAGAC CCACG CT CA 
CCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTC 
CAT C CAGT CT ATT AATTGTTG CCGGG AAG CTAGAGTAAGTAGTT CG C CAGTT AATAGTTTGCG CAACG TTGTTGG CATTG 
CTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTT(^TTCAGCTCCGGTTCCC^CGATCAAGGCGAGTTACA 
TGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTT 
ATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGT 
ACT CAACCAAGT CATTCTGAG AATACCG CG C C CGG CG AC CGAGTTGCTCTTG C CCGG CG T CAAT ACGGGAT AAT AGTG TA 
TG ACATAG CAGAACTTTAAAAG TG CTCATCATTGGAAAACGTT CTTCGGGG CGAAAACTCTCAAGGAT C TTACCG CTG TT 
GAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCT^ 
CAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATC 

CAATATT ATTG AAG CATTTAT CAGGGTTATTGTCT CATGAG CGGATACATATTTGAATGTAT TT AG AAAAAT AAACAAAT 
AGGGGTTCCG CG CACATTTCCC CGAAAAG TG CCAC CTGTATG CGGTG TGAAATACCG CACAGATG CGTAAGGAG AAAATA 
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C CG CAT CAGG CG AAATTG T AAACG TT AAT ATTTTG TT AAAATT CG CG TT AAAT ATTTG TT AAAT CAG CT CATTTTTT AAC 
CAAT AGG C CG AAAT CGG CAAAAT C C CTT AT AAAT CAAAAG AAT AG AC CG AG AT AGGG TTG AG TG TTG T T C CAG TTTGG AA 
CAAG AG T C CAC T ATT AAAG AACG TGG AC T C CAACG T CAAAGGG CG AAAAAC CG T CT AT CAGGG CG ATGG C C CAC T ACG TG 
AACCATCACCGAAATCAAGTTTTTTGCGGTCGAGGTGCCG 

AG AG CTTG ACGGGG AAAG C CGG CG AACG TGG CG AG AAAGG AAGGG AAG AAAG CG AAAGG AG CGGG CG CT AGGG CG CTGG C 
AAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAA 
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TTGCGGCCGCTTCGAACTGTTATAAAAAAAGGATC^TTTTGAACTCTCTCCCAAAGTTGATCCCTTAAC^ 

T CCCTTTGAGAATGTTT ATATACATTCAAGGTAACCAG C CAACTAATG ACAATG ATT C CTGAAAAAAGTAATAACAAATT 

ACTATACAG AT AAG TTGACTGAT CAACTT C CATAGGTAACAACCTTTGAT CAAGTAAGGGTATGGATAATAAACCAC CTA 

CAATTGCAATACCTGTTCCCTCTGATAAAAAGCTGGTAAAGTTAAGCAAACTCATTCG 

AAG CTACTTGAAACAATTGTTGAT ATAACTGTTTTGG TG AACGAAAG CC CAC CTAAAACAAAT ACGATTAT AATTGT CAT 
GAAC CATG ATGTTGTTT CTAAAAGAAAGGAAG CAGTTAAAAAG CT AACAGAAAGAAATGT AACT C CG ATGTTTAACACGT 
AT AAAGGACCT CTT CTAT CAACAAGTAT C C CACCAATGT AG C CGAAAAT AATGACAC T CATTGT T C CAGGG AAAATAATT 
ACACTT C CGATTT CGG CAGTACTTAG CTGGTGAACAT CTT T CAT CATATAAGGAACCATAG AGACAAAC C C TG CTACTGT 
TC CAAAT ATAATT C C CCCACAAAGAACT C CAATCATAAAAGGTATATTTTTC CCT AAT C CGGGAT CAACAAAAGGATC TG 
TTACTTTCCTGATATGTTTTACAAATATCAGGAATGACAGCACGCTAACGATAAGAAAAGAAATGCTATATGATGTTGTA 
AACAACATAAAAAAT ACAATG CCTACAGAC^TTAGTATAATTCCTTTGAT AT CAAAATG AC CTTTTATC CTT ACTT CTT 

CTTT AATAATT T CATAAGAAACGGAACAGTG ATAATTGTT AT CATAGG AATGAGTAGAAGAT AGG AC CAATGAATATAAT 
GGGCTATC^TTCC^CC^TCGCTGGACCGACTCCTTCTCCCATGGCTACTATCGATCC^TAAGACCAAATGCTTTACCC 
CTATTTTCCTTTGGAATATAGCGCGCAACTACAACCATTACGAGTGCTGGAAATGCAGCTGCACCAGCCCCTTGAATAAA 
ACGAGC CAT AAT AAGTAAGGAAAAG AAAG AATGG CCAACAAAC C CAATTAC CGAC C CG AAACAATTT ATTATAATTC CAA 
ATAGGAGT AAC CTTTTGATG C CTAATTG AT CAGAT AG CT TTC CAT ATACAG CTGTT C CAATGGAAAAGG TTAACATAAAG 
GCTGTGTTC^CCCAGTTTGTACTCGCAGGTGGTTTATTAAAATCATTTGCAATATCAGGTAATGAGACGTTCAAAACCAT 
TTCATTTAATACGCTAAAAAAAGATAAAATGCAAAGCCAAATTAAAATTTGGTTGTGTCGTAAATTCGATTGTGAATAGG 
ATGTATTCACATTTC^CCCTCC^TAATGAGGGCAGACGTAGTTTATAGGGTTAATGATACGCTTCCCTCTTTTAATTGA 
ACCCTGTTACATTCATTACACTTCATAATTAATTCCTCCTAAACTTGATTAAAACATTTTACCACATATAAACTAAGTTT , 
TAAATTCAGTATTTCATCACTTATACAAGAATATGGCCCGTTTGTTGAACTACTCTTTAATAAAATAATTTTTCCGTTCC '( 
CAATTCC^CATTGCAATAATAGAAAATCCATCTTCATCGGCTTTTTCGTCATC^TCTGTATGAATCA7^TCGCCTTCTTC 
TGTGTCATCAAGGTTTAATTTTTTATGTATTTCTTTTAACAAACCACCATAGGAGATTAACCTTTTACGGTGTAAACCTT > ; 
CCTCCAAATCAGACAAACGTTTCAAATTCTTTTCTTCATCATCGGTCATAAAATCCGTATCCTTTACAGGATATTTTGCA £ 
GTTTCGTCAATTGCCGATTGTATATCCGATTTATATTTATTTTTCGGTCGAATCATTTGAACTTTTACATTTGGATCATA £ 
GTCTAATTTCATTGCCTTTTTCGAAAATTGAATCCATTX3TTTTTGATTCACGTAGTTTTCTGTATTCTTAAJyVTAAGTT *v 
GTTCCACACATACCAATACATGCATGTGCTGATTATAAGAATTATCTTTATTATTTATTGTCACTTCCGTTGCACGCAT^ 
AAACCAACAAGATTTTTATTAATTTTTTTATATTGCATCATTCGGCGAAATCCTTGAGCCATATCTGACAAACTCTTATT 
TAATTCTTCGCC^T<^TAAAC^TTTTTAACTGTTAATGTGAGAAAC^CC^CGAACTGTTGGCTTTTGTTTAATAACTT ^ 

CAGCAAGAACCTTTTGTGACTGAATGCCATGTTTCATTGCTCTCCTCCAGTTGCAC ' t 
AACCAC^CTCGATACAACTTTCTTTCGCCTGTTTCACGATTTTGTTTATACTCTAATATTTCAGCACAATCTTTTACTCT 
TTCAGCCTTTTTAAATTC^GAATATGCAGAAGTTCAAAGTAATCAACATTAGCGATTTTCTTTTCTCTCCATGGGGAAT . 
TGGAATTCTCAGTCGCTCCAGTTGCAAACGATTCTGGATAGTTTGCCGGATTTGCGATAGAACCAGGCCCGCCGGGAATA 

AAGAGATC CGTATT C C C CG C TGAAAACT CAGGG AAAATATCGG CCGAACG C CAGG CATTGAC CATG T CT C TGTAC CAT TC 
ATCAAGTCCAGAGCCCCCTCCCCATGAGTTATTGACAACATCAGGAGCCATTTCCGGGTGGGGATTTCCTTCCGCGTCCT 

TTGGTGCTAAAACCCATTCACC^GCTTCC^AAATGTCAGC^^ 

CATTTTGCGCC^GGTGCTAC^CCGATTTGATTTGTTCC^TCAGGTTC^GAGCCCACCATCGTGCCTGTC^CGTGGGTTCC 

ATGAG CCAAAT CAT CAT AAGGG CTTG CCT CGCCTG CTACGG CAT CAT AC CAGT T CAT T T CATTTT CAGG CT CATTAGGAT 
TTTCCGGATTATATCCGCGATATTTCTCTTTTAATGCCGGATGATTCCATTCCACCCCGGTATGAATGGACGC^ACAACC 

GTGCCAGTTCCATCATATCCAAGTGCCCAAGCTTTTG 

TTTAATAGCTTTCTGTGCTTTTTTCATATTAAATGGGGAGGATGACTTAAAAAGCTGCCGTTTCTCATTAGGAAGCACCT 
TTTCCACTTCGGGAAACTGCACGACTTTTTCCATAACCTCTTTTGAGGCATGT^ 

GAATGAATTTGGTCTGCATTTCCTTTATCTTTCTGGGTGTTCAAGTATTTTAGGACATCTTGCTGGGATTCATCGGCTGT 
GACTTTT AAAG ATGACACAACAG CAG AACG CTTTTG AT ATT CCGT CTT AG CGG CAGACAGCTTCTTCGATTTCG CTTT TT 
TAACAGCCGCTTTTGCCGCTTTTTCTGGGTACCCCTATGTATTACTAGAAAATAACATAGTAAAACGGACATCACTCCGT 
TTCAATGGAGGTGATGTCCGTTTTTCATTACAACAAATTACTTATCTATTTGTAATGCTGCTCTTGGACCCGGGATCCGC 
AAGTGCTTTTACTAAATAATCCATTACTAAACCTCCTATACACTTCCTACGCGGTTTCTTTCATAAATCAATTCCAGCCC 
TTTTAGGGTTAAGAATGGAT CAACGATGTCTATACAATC TGATT CGTT CG CAATGAG CGG CG CCAGG C CTC CTG TCG CAA 
TGACCTTGGGTCCTGTTTTGCCTGCCATTTCATTCGCTTAACGATTCCTTCC^^ 

CAGATTGCATCGCGCTAACT^GTGTTTTTTCCGATAATATTGTCGGGCCGGGTGATTTCGATACGAGGAAGCTTTGCTG^ 
CGCGAGTAAAGCGCCTCTGTCGAAATTGTAATCCGAGGGGCAATCGCCCCGCCCATGTATTGTTTGTTTTCATCAATATA 

G CAGTACG TTGTGG CGGTTCCGAAAT CGACAACAATT AATGGATTGCCG TACAAGTG TAT CGCAG CG ACAG CATTTACGA 
TTCTGTCTGCCCCTACTTCTTTCGGATTGTGATATTTTATATTTAAACCGGTTTTCATACCTGGACCAACAATTTGAGGC 

TCGATATGAAAGTATTTTGTGGACATTCTTTCTAACGCAAACATGATTGGCGGC^ 
TATCTGTTCAAACATAAGCCCGGAGTGATCAAATAAGGAGCGCAAAATCATCCCAAA 
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TTG TTT CT AT ACG C CAG TG AT ATT CT AATTTT C CAT CATG AT AT ACAC CAAG T ACAG T ATTGG TG TTC C C CACAT CG AT A 
ACCAG TAACAACCT CTATCAC CACTTTTGAATAGTTTCTCTAGAACAGGCGGGGTTGCCC CCG CCTGT AATTAAATTATT 
ACACACCCTGTAGGGAAAGTCAATACCTTTTTGTAAAATTTTTACACAGCGTGGATCTCTTCTAGGGACACCTCTTTGTA 
CC CCTCAAGGGAGAAAT ATTGG CGGTACTGAG CACAGTTTTGGTTGG TGG ACAGTG AACCATAG CTGT CGT CAATAG CCT 
CGAGTTATGGCAGTTGGTTAAAAGGAAACAAAAAGACCGTTTTCACACAAAACGGTCTTTTTCGATTTCTTTTTACAGT 

ACAGCCACTTTTGCAAAAACCGGACAGCTTCATGCCTTATAACTCCTGTTTCGGTC^ 
AAATTCACTGGCCGTCGTTTTACAACGTCX3TGACTGGGAAAACCCTGGCG 

CCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAA 

TGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATATGGTGCACTCTCAGTACAATCTC 

CTCTGATG C CG CATAGTTAAG CCAG CC C CGACAC CCG CCAACACC CG CTG ACTATG CTTGTAAAC CGTTTTGTGAAAAAA 

TTTTTAAAATAAAAAAGGGGACCTCTAGGGTCCCCAATTAATTAGTAATATAATCTATTAAAGGTCATTC 

CCACCGGATCAGCTTAGTAAAGCCCTCGCTAGATTTTAATGCGGATGTTGCGATTACTTCGCCAACTATTGCGATAACAA 

GAAAAAGCCAGC CTTTCATGATATAT CTCCCAATTTGTGTAGGG CTTATTATGCACG CTTAAAAATAATAAAAG CAGACT 

TGACCTGAT AGTTTGG CTG TGAG CAATTATGTGCTTAGTG CATCTAACG CTTGAGTT AAG C CG CG CCG CGAAG CGG CGT C 

GGCTTGAACGAATTGTTAGACATTATTTGCCGACTACCTTGGTGATCTCGCCTTTCACGTAGTGGACAAATTCTTCCAAC 

TGATCTGCG CG CGAGG CCAAG CGATCTTCTT CTTGT C CAAGATAAG CCTGTC TAG CTT CAAGT ATGACGGGCTGATACTG 

GGCCGGCAGGCGCTCCATTGCCCAGTCGGCAGCGACATCCTTCGGCGCGATTTTGCCGGTTACTGCGCTGTACCAAATGC 

GGGACAACGTAAG CACTACAT TT CG C T CAT CG CCAG C C CAGT CGGG CGG CGAGTTCCAT AG CG TT AAGGTTT CATTTAG C 

GCCTCAAATAGATCCTGTTCAGGAACCGGATCAAAGAGTTCCTCCGCCGCTGGACCTACCAAGGCAACGCTATGTTCTCT 

TGCTTTTGTCAGCAAGATAGCCAGATCAATGTCGATCGTGGCTGGCTCGAAGATACCTGCAAGAATGTCATTGCGCTGCC 

ATTCTCCAAATTGCAGTTCGCGCTTAGCTGGATAACGCCACGGAATGATGTCGTCGTGCACAACAATGGTGACTTCTACA 

GCGCGGAGAATCTCGCTCTCTCCAGGGGAAGCCGAAGTTTCCAAAAGGTCGTTGATCAAAGCTCGCCGCGTTG TTT CATC 

AAGCCTTACGGTCACCGTAACCAGCAAATCAATATCACTGTGTGGCTTCAGGCCGCCATCCACTGCGGAGCCGTACAAAT 

GTACGGCCAGCAACGTCGGTTCGAGATGGCGCTCGATGACGCCAACTACCTCTGATAGTTGAGTCGATACTTCGGCGATC 

ACCGCTTCCCTCATGATGTTTAACTTTGTTTTAGGGCGACTGCCCTGCTGCGTAACATCGTTGCTGCTCCATAACATCAA 

ACAT CGAC C CACGG CG TAACG CG CTTGC TG C TTGG ATG C CCG AGG CATAGACTG TAC CC CAAAAAAACAG T CAT AACAAG 

CCATGAAAACCGCCACTGCGCCGTTACC^CCGCTGCGTTTOGTC^^GGTTCTGGACCAGTTGCGTGAGCGCATACGCTAC 

TTGCATTACAGCTTACGAACCGAACAGGCTTATGTCCACTGGGTTCGTGCCTTCATCCGTTTCCACGGTGTGCGTCACCC 

GGGAACCTTGGGCAGCAGCGAAGTCGAGGCATTTCTGTCCTGGCTGGCGAACGAGCGC^ 

GTCAGGCATTGGCGGCCTTGCTGTTCTTCTACGGCAAGGTGCTGTGCACGGATCTGCCCTGGCTTCAGGAGATCGGAAGA 
CCT CGG C CGT CG CGG CGC TTG C CGG TGG TG CTG AC C CCGG ATG AAG TGGT T CGCAT CCT CGG T TT T CTGG AAGG CG AG CA 
TCGTTTGTTCGCCCAGCTTCTGTATGGAACGGGCATGCGGATCAGTGAGGGTTTGCAACTGCGGGTCAAGGATCTGGATT 
TCGATCACGGCACGATCATCGTGCGGGAGGGCAAGGGCTCCAAGGATCGGGCCTTGATGTTACCCGAGAGCTTGGCACCC 
AGCCTGCGCGAGCAGGGGAATTGATCCGGTGGATGACCTTTTGAATGACCTTTAATAGATTATATTACTAATTAATTGGG 
GACCCTAGAGGTCCCCTTTTTTATTTTAAAAATTTTTTCACAAAACGGTTTACAAGCATAACGGGTTTTGCTGCCCGCAA 
ACGGGCTGTTCTGGTGTTGCTAGTTTGTTATCAGAATCGCAGATCCGGCTTCAGGTTTGCCGGCTGAAAGCGCTATTTCT 
TCC^GAATTGCC^TGATTTTTTCCCCACGGGAGGCGTCACTGGCTCCCGTGTTGTCGGCAGCTTTGATTCGATAAGCAGC 
ATCGCCTGTTTCAGGCTGTCTATGTGTGACTGTTGAGCTGTAACAAGTTGTCTCAGGTGTTCAATTTCATGTTCTAGTTG 
CTTTGTTTTACTGGTTTCACCTGTTCTATTAGGTGTTACATGCTGTTCATCTGTTACATTGTCGATCTGTTCATGGTGAA 
CAGCTTTAAATGCACCAAAAACTCGTAAAAGCTCTGATGTATCTATCTTTTTTACACCGTTTTCATCTGTGCATATGGAC 
AGTTTTCCCTTTGATATCTAACGGTGAACAGTTGTTCTACTTTTGTTTGTTAGTCTTGATGCTTCACTGATAGATACAAG 
AGCCATAAGAACCTCAGATCCTTCCGTATTTAGCCAGTATGTTCTCTAGTGTGGTTCGTTGTTTTTGCGTGAGCCATGAG 
AACGAACCATTGAGATCATGCTTACTTTGCATGTGACTO^AAAATTTTGCCTCAAAACTGGTGAGCTGAATTTTTGCAGT 
T/^AAGCATCGTGTAGTGTTTTTCTTAGTCCGTTACGTAGGTAGGAATCTGATGTAATGGTTGTTGGTATTTTGTCACCAT 
TCATTTTTATCTGGTTGTTCTCAAGTTCGGTTACGAGATCCATTTGTCTATCTAGTTCAACTTGGAAAATCAACGTATCA 
GTCGGGCGGCCTCGCTTATCAACCACCAATTTCATATTGCTGTAAGTGTTTAAATCTTTACTTATTGGTTTCAAAACCCA 
T TGGTTAAG C CTTTTAAAC TCATGG TAG TTATTTT CAAG CATTAACATGAAC TTAAATT CATCAAGG CTAAT CT CTAT AT 
TTGCCTTGTGAGTTTTCTTTTGTGTTAGTTCTTTTAATAACCACTCATAAATCCTCATAGAGTATTTGTTTTCAAAAGAC 
TTAACATGTTCCAGATTATATTTTATGAATTTTTTTAACTGGAAAAGATAAGGCAATATCTCTTCACTAAAAACTAATTC 
TAATTTTTCGCTTGAGAACTTGGCATAGTTTGTCCACTGGAAAATCTCAAAGCCTTTAACCAAAGGATTCCTGATTTCCA 
CAGTTCTCGTCATCAGCTCTCTGGTTGCTTTAGCTAATACACCATAAGCATTTTCCCTACTGATGTTCATCATCTGAGCG 
TATTGGTTATAAGTGAACGATACCGTCCGTTCTTTCCTTGTAGGGTTTTCAATCGTGGGGTTGAGTAGTGCCACACAGCA 
TAAAATTAGCTTGGTTTCATGCTCCGTTAAGTCATAGCGACTAATCGCTAGTTCATTTGCTTTGAAAACAACTAATTCAG 
ACATACATCTCAATTGG T CTAGGTGATTTTAATCACTATAC CAATTG AGATGGG CTAG T CAATGATAATTACTAG T C CTT 
TTCCTTTGAGTTGTGGGTATCTGTAAATTCTGCTAGACCTTTGCTGGAAAACTTGTAAATTCTGCTAGACCCTCTGTAAA 
TTCCGCTAGACCTTTGTGTGTTTTTTTTGTTTATATTCAAGTGGTTATAATTTATAGAATAAAGAAAGAATAAAAAAAGA 
TAAAAAGAATAGAT CCCAG C CCTGTG TAT AACTCACTACTTTAGTCAGTT CCG CAG TATTACAAAAGGATGTCG CAAACG 
CTGTTTGCTCCTCTACAAAACAGACCTTAAAACCCTAAAGGCTTAAGTAGCACCCTCGCAAGCTCGGGCAAATCGCTGAA 
TATTCCTTTTGTCTCCGACCATCAGGCACCTGAGTCGCTGTCTTTTTCGTGACATTCAGTTCGCTGCGCTCACGGCTCTG 
G CAGTGAATGGGGGTAAATGG CACTACAGG CGCCTTTT ATGGATTCATG CAAGG AAACTAC CCATAATACAAG AAAAGCC 
CGTCACGGGCTTCTCT^GGGCGTTTTATGGCGGGTCTGCTATGTGGTGCTATCTGACTTTTTGCTGTTCAGCAGTTCCTGC 
C CTCTGATTTT C CAGTCTGAC CAC TT CGGATTATCC CGTGACAGGTCATTCAGACTGG CTAATGCAC C CAG TAAGG CAG C 
GGTATCATCAACAGGCTTACCCGTCTTACTGTCAAC 
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SEQ ID NO: 74 - pOTP71 

CCATCGAATGGCCAGATGATTAATTCCTAATTTTTGTTGACACTCTATCATTGATAGAGTTATTTTACCACTCCCTA 

TCAGTGATAGAGAAAAGTGAAATGAATAGTTCGACAAAAATCTAGAAAAGGAGGAATTTAAaTGTTACTGGTTATCG 

ATGTGGGGAACACCAATACTGTACTTGGTGTATATCATGATGGAAAATTAGAATATCACTGGCGTATAGAAACAAGC 

AGGCATAAAACAGAAGATGAGTTTGGGATGATTTTGCGCTCCTTATTTGATCACTCCGGGCTTATGTTTGAACAGAT 

AGATGGCATTATTATTTCGTCAGTAGTGCCGCCAATCATGTTTGCGTTAGAAAGAATGTGCACAAAATACTTTCATA 

TCGAGCCTCAAATTGTTGGTCCAGGTATGAAAACCGGTTTAAATATAAAATATGACAATCCGAAAGAAGTAGGGGCA 

GACAGAATCGTAAATGCTGTCGCTGCGATACACTTGTACGGCAATCCATTAATTGTTGTCGATTTCGGAACCGCCAC 

AACGTACTGCTATATTGATGAAAACAAACAATACATGGGCGGGGCGATTGCCCCTGGGATTACAATTTCGACAGAGG 

CGCTTTACTCGCGTGCAGCAAAGCTTCCTCGTATCG7VAATCACCCGGCCCGACAATATTATCGGAAAAAACACTGTT 

AGCGCGATGCAATCTGGAATTTTATTTGGCTATGTCGGCCAAGTGGAAGGAATCGTTAAGCGAATGAT^ATGGCAGGC 

AAAACAGGACcCCAAGGTCATTGCGACAGGAGGCCTGGCGCCGCTCATTGCGAACGAATCAGATTGTATAGACATCG 

TTGATCCATTCTTAACCCTAAAAGGGCTGGAATTGATTTATGAAAGAAACCGCGTAGGAAGTGTATAaGGATCCCTC 

GAGGTCGACCTGCAGGGGGACCATGGTCTCAGCGCTTGGAGCCACCCGCAGTTCGAAAAATT^ATAAGCTTGACCTGT 

GAAGTGAAAAATGGCGCACATTGTGCGACATTTTTTTTGTCTGCCGTTTACCGCTACTGCGTCACGGATCTCCACGC 

GCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAG 

CGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGG 

CTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAG 

TGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCC 

AAACTGGAACAACACTC/^ACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGG 

TTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGCTTACAATTTCAGGTGGCAC 

TTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGAC 

AATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATT 

CCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCA 

GTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAAC 

GTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAA 

CTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACG6ATGG 

CATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGA 

TCGGAGGA.CCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCG 

GAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACT 

ATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTGATAGACTGGATGGAGGCGGATAAAGTTGCAGGAC 

CACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGCTCTCGCGGT t 

ATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTAT^ 

GGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAGGAATTAATGATGTCTCGTT 

TAGATAAAAGTAAAGTGATTAACAGCGCATTAGAGCTGCTTAATGAGGTCGGAATCGAAGGTTTAACAACCCGTAAA 

CTCGCCCAGAAGCTAGGTGTAGAGCAGCCTACATTGTATTGGCATGTAAAAAATAAGCGGGCTTTGCTCGACGCCTT 

AGCCATTGAGATGTTAGATAGGCACCATACTCACTTTTGCCCTTTAGAAGGGGAAAGCTGGCAAGATTTTTTACGTA 

ATAACGCTAAAAGTTTTAGATGTGCTTTACTAAGTCATCGCGATGGAGCAAAAGTACATTTAGGTACACGGCCTACA 

GAAAAACAGTATGAAACTCTCGAAAATCAATTAGCCTTTTTATGCCAACAAGGTTTTTCACTAGAGAATGCATTATA 

TGCACTCAGCGCAGTGGGGCATTTTACTTTAGGTTGCGTATTGGAAGATCAAGAGCATCAAGTCGCTAAAGAAGAAA 

GGGAAACACCTACTACTGATAGTATGCCGCCATTATTACGACAAGCTATCGAATTATTTGATCACCAAGGTGCAGAG 

CCAGCCTTCTTATTCGGCCTTGAATTGATCATATGCGGATTAGAAAAACAACTTAAATGTGAAAGTGGGTCTTAAAA 

GCAGCATAACCTTTTTCCGTGATGGTAACTTCACTAGTTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCA 

TGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGA 

GATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGA 

TCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGT 

AGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTG 

GCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGAT7VAGGCGCAGCGGTC 

GGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTG 

AGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGA 

GAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGA 

GCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCC 

TGGCCTTTTGCTGGCCTTTTGCTCACATGACCCGACA 



SEQ ID NO: 75 - pOTP72 

CCATCGAATGGCCAGATGATTAATTCCTAATTTTTGTTGACACTCTATCATTGATAGAGTTATTTTACCACTCCCTA 
TCAGTGATAGAGAAAAGTGAAATGAATAGTTCGACAAAAATCTAGAAAAGGAGGAATTTAAATGCCAGCTAGGCAAT 
CTTTTACAGATTTGAAAAACCTGGTTTTGTGCGATATAGGCAACACGCGTATCCATTTTGCACAAAACTATCAGCTC 
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TTTTCAAGCGCTAAAGAAGATTTAAAGCGTTTGGGTATTCAAAAGGAAATTTTTTACATTAGCGTGAATGAAGAAAA 
„„,.-•■» ^mm^mKo^mB.^cpnnRBSTBTTcrafznriTTTTTTrATTTAGAAACCGACTATGTAG 



TTTTCAAGCGCTAAAGAAtjATT TflAAlalAji TTbbbl hi n,nnnnuonnni ' " — 

TGAAAAAGCCCTTTTGAATTGTTACCCTAACGCTAAAAATATTGCAGGGTTTTTTCATTTAGAAACCGACTATGTAG 

GGCTTGGGATAGACCGGCAAATGGCGTGTCTGGCGGTAAATAATGGCGTGGTGGTGGATGCCGGGAGTGCGATTACG 

ATAGATTTAATCAAAGAGGGCAAGCATTTAGGAGGGTGTATTTTACCCGGTTTAGCCCAATATATTCATGCGTATAA 

AAAAAGCGCTAAAATTTTAGAGCAACCTTTCAAGGCCTTAGATTCTTTAGAAGTTTTACCTAAAAGCACTAGAGACG 

CTGTGAATTACGGCATGGTTTTGAGCGTCATTGCTTGTATCCAGCATTTAGCCAAAAATCAAAAAATCTATCTTTGT 

GGGGGCGATGCGAAGTATTTGAGCGCGTTTTTACCCCATTCTGTTTGCAAGGAGCGTTTGGTTTTTGACGGGATGGA 

. , . , .,^o^^o^^^»r B A-rr.rBoaTiisr:r.nTrrrTrRAr,r,TreAf:CTGCAGGGGGACCATGGTC 



AATCGCTCTTAAAAAAGCAGGGATACTAGAATGCAAATAAGGATCCCTCGAGGTCGACCTGCAGGGGGACCATGGTC 
TCAGCGCTTGGAGCCACCCGCAGTTCGAAAAATAATAAGCTTGACCTGTGAAGTGAAAAATGGCGCACATTGTGCGA 
CATTTTTTTTGTCTGCCGTTTACCGCTACTGCGTCACGGATCTCCACGCGCCCTGTAGCGGCGCATTAAGCGCGGCG 
GGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTC 
CTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTT 
TACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTT 
CGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTC 
GGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAAT 
TTAACGCGAATTTTAACAAAATATTAACGCTTACAATTTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCT 
ATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATA 
TTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTG 
TTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAA 
CTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGT 
TCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGA 
ATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCT 
GCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTT 
TTTGCAGAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACG 
AGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCT 
TCCCGGCAACAATTGATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGG 
CTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGCTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTA 
AGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAG 
ATAGGTGCCTCACTGATTAAGCATTGGTAGGAATTAATGATGTCTCGTTTAGATAAAAGTAAAGTGATTAACAGCGC 
ATTAGAGCTGC^TAATGAGGTCGGAATCGAAGGTTTAACAACCCGTAAACTCGCCCAGAAGCTAGGTGTAGAGCAGC 
CTACATTGTATTGGCATGTAAAAAATAAGCGGGCTTTGCTCGACGCCTTAGCCATTGAGATGTTAGATAGGCACCAT 
ACTCACTTTTGCCCTTTAGAAGGGGAAAGCTGGCAAGATTTTTTACGTAATAACGCTAAAAGTTTTAGATGTGCTTT 
ArTAAGTCATCG^GATGGAGCAAAA'GTACATTTAGGTACACGGCCTACAG/^AAAACAGTATGAAACTCTCGAAAATC 
AATTAGCCTTTTTATGCCAACAAGGTTTTTCACTAGAGAATGCATTATATGCACTCAGCGCAGTGGGGCATTTTACT 
TTAGG^TGCGTATTGGAAGATCAAGAGCATCAAGTCGCTAAAGAAGAAAGGGAAACACCTACTACTGAT.AGTATGCC 
GCCATTATTACGACAAGCTATCGAATTATTTGATCACCAAGGTGCAGAGCCAGCCTTCTTATTCGGCCTTGAATTGA. 
TCAT^TGCGGATTAGAAAAACAACTTAAATGTGAAAGTGGGTCTTAAAAGCAGCATAACCTTTTTCCGTGATGGTAA 
CTTCACTAGTTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTC 
GTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCT 
GCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAG 
GTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAA 
CTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTC 
TTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAG 
CCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGA 
AGGGAGAAAGGCGGACAGGTATCCGGTAAGGGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAA 
ACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGG 
GGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACAT 

GACCCGACA 

SEQ ID NO: 76 - pOTP73 

CCATCGAATGGCCAGATGATTAATTCCTAATTTTTGTTGACACTCTATCATTGATAGAGTTATTTTACCACTCCCTA 
TCAGTGATAGAGAAAAGTGAAATGAATAGTTCGACAAAAATCTAGAAAAGGAGGAATTTAAATGATTCTTGAGCTCG 
ACTGTGGAAACTCGCTGATCAAGTGGCGGGTCATCGAGGGGGCGGCGCGGTCGGTCGCCGGTGGCCTTGCGGAGTCC 
GATGATGCCCTGGTCGAACAGTTAACGTCGCAGCAAGCGCTGCCAGTGCGAGCCTGTCGCCTGGTGAGCGTTCGCAG 
CGAGCAGGAAACCTCGCAACTGGTCGCACGGTTGGAGCAGCTGTTCCCGGTTTCGGCGCTGGTTGCATCATCCGGCA 
AGCAGTTGGCGGGTGTGCGCAACGGCTATCTCGATTACCAGCGCCTGGGGCTCGACCGCTGGCTGGCCCTCGTCGCG 
GCTCATCACCTGGCTAAGAAGGCCTGCCTGGTCATTGATCTGGGGACCGCGGTCACCTCTGACCTGGTCGCGGCGGA 
TGGAGTGCATCTGGGGGGCTACATATGCCCGGGCATGACCCTGATGAGAAGCCAGTTGCGCACCCATACCCGACGTA 
TCCGCTACGACGATGCAGAGGCCCGGCGGGCGCTTGCCAGTCTCCAGCCAGGGCAGGCCACGGCCGAGGCGGTTGAG 
CGGGGTTGTCTGCTCATGCTCAGGGGGTTCGTTCGTGAGCAGTACGCCATGGCGTGCGAGCTGCTCGGTCCGGATTG 
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TGAAATATTCCTGACGGGTGGGGATGCCGAACTGGTTCGCGACGAACTGGCTGGCGCCCGGATCATGCCGGACCTGG 
TTTTCGTAGGGCTGGCACTGGCTTGCCCGATTGAGTAAGGATCCCTCGAGGTCGACCTGCAGGGGGACCATGGTCTC 
AGCGCTTGGAGCCACCCGCAGTTCGAAAAATAATAAGCTTGACCTGTGAAGTGAAAAATGGCGCACATTGTGCGACA 
TTTTTTTTGTCTGCCGTTTACCGCTACTGCGTCACGGATCTCCACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGG 
TGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCT 
TTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTA 
CGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCG 
CCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGG 
TCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTT 
AACGCGAATTTTAACAAAATATTAACGCTTACAATTTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTAT 
TTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATT 
GAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTT 
TTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACT 
GGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTC 
TGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAAT 
GACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGC 
CATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTT 
TGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAG 
CGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTC 
CCGGCAACAATTGATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCT 
GGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGCTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAG 
CCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGAT 
AGGTGCCTCACTGATTAAGCATTGGTAGGAATTAATGATGTCTCGTTTAGATAAAAGTAAAGTGATTAACAGCGCAT 
TAGAGCTGCTTAATGAGGTCGGAATCGAAGGTTTAACAACCCGTAAACTCGCCCAGAAGCTAGGTGTAGAGCAGCCT 
ACATTGTATTGGCATGTAAAAAATAAGCGGGCTTTGCTCGACGCGTTAGCCATTGAGATGTTAGATAGGCACCATAC 
TCACTTTTGCCCTTTAGAAGGGGAAAGCTGGCAAGATTTTTTACGTAATAACGCTAAAAGTTTTAGATGTGCTTTAC 
TAAGTCATCGCGATGGAGCAAAAGTACATTTAGGTACACGGCCTACAGAAAAACAGTATGAAACTCTCGAAAATCAA 
TTAGCCTTTTTATGCCAACAAGGTTTTTCACTAGAGAATGCATTATATGCACTCAGCGCAGTGGGGCATTTTACTTT 
AGGTTGCGTATTGGAAGATCAAGAGCATCAAGTCGCTAAAGAAGAAAGGGAAACACCTACTACTGATAGTATGCCGC 
■CATTATTACGACAAGCTATCGAATTATTTGATCACCAAGGTGCAGAGCCAGCCTTCTTATTCGGCC.TTGAATTGATC 
ATATGCGGATTAGAAAAACAACTTAAATGTGAAAGTGGGTCTTAAAAGCAGCATAACCTTTTTCCGTGATGGTAACT 
TCACTAGTTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGT- 
TCCACTGAGCGTCAGACCCCGTAGAAAAGATGAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGC 
TTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGT 
AACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACT 
CTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTT 
ACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCA T CACAGCC 
CAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAG 
GGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAAC 
GCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGG 
GCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGA 



CCCGACA 



